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The invention provides methods and kits for producing specific binding pairs (sbp) members. Populations of polypeptide 
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ages (rgdp). At least one of the polypeptide chains is expressed as a fusion with a component of an rgdp which thereby displays 
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tide chain. The methods enable production of libraries of multimeric sbp members from a very large number of possible combina- 
tions. In one embodiment of the invention a method employs "chain shuffling" in the production of sbp members of desired spe- 
cificity for a counterpart sbp member. Selection procedures are also described. 
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METHODS FOR PRODUCING MEMBERS OF 
SPECIFIC BINDING PAIRS 

The present invention relates to methods for 
producing members of specific binding pairs. The present 
5 invention also relates to the biological binding 
molecules produced by these methods* 

Owing to their high specificity for a given 
antigen, the advent of monoclonal antibodies (Kohler, G. 
and Milstein C; 1975 Nature 256 : 495) represented a 
10 significant technical break-through with important 
consequences both scientifically and commercially. 

Monoclonal antibodies are traditionally made by 
establishing an immortal mammalian cell line which is 
derived from a single immunoglobulin producing cell 
15 secreting one form of a biologically functional antibody 
molecule with a particular specificity. Because the 
antibody- secreting mammalian cell line is immortal, the 
characteristics of the antibody are reproducible from 
batch to batch. The key properties of monoclonal 
20 antibodies are their specificity for a particular antigen 
and the reproducibility with which they can be 
manufactured . 

Structurally, the simplest antibody (igG) 
comprises four polypeptide chains, two heavy (H) chains 
25 and two light (L) chains inter- connected by disulphide 
bonds (see figure 1). The light chains exist in two 
distinct forms called kappa (K) and lambda (/). Each 
chain has a constant region (C) and a variable 
region (V). Each chain is organized into a series of 
30 domains. The light chains have two domains, 

corresponding to the C region and the other to the V 
region. The heavy chains have four domains, one 
corresponding to the V region and three domains (1,2 and 
3) in the C region. The antibody has two arms (each arm 
35 being a Fab region), each of which has a VL and a VH 

region associated with each other. It is this pair of V 
regions (VL and VH) that differ from one antibody to 
another (owing to amino acid sequence variations), and 
which together are responsible for recognising the 
40 antigen and providing an antigen binding site (ABS). In 
even more detail, each V region is made up from three 
complementarity determining regions (CDR) separated by 
four framework regions (FR). The CDR's are the most 
variable part of the variable regions, and they perform 
45 the critical antigen binding function. The CDR regions 
are derived from many potential germ line sequences via a 
complex process involving recombination, mutation and 
selection. 

It has been shown that the function of binding 
50 antigens can be performed by fragments of a whole 

antibody. Example binding fragments are (i) the Fab 
fragment consisting of the VL, VH, CL and CHI domains; 
( ii ) the Fd fragment consisting of the VH and CHI 
domains; (iii) the Fv fragment consisting of the VL and 
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VH domains of a single arm of an antibody, (iv) the dAb 
fraoment (Ward, E.S. et al., Nature 341/ 544-546 (1989) 
„Sc? consists of a VH domain; (v) isolated CDR regions; 
and (vi) F(ab') 2 fragments, a bival^t fra^ent comprising 
5 two Fab fragments linked by a disulphide bridge at the 
hinge region ^ ^ domains of the Fv fragment are 
coded for by separate genes, it has proved possible to 
make a synthetic linker that enables them to be made as a 
10 sinale protein chain (known as single chain Fv (scFv); 
SSI R P E et al., Science 242 423-426 (1988) Huston 
J.S. et al-, Proc. Natl. Acad. Sex., USA 85, 5879-5883 
(1988)) by recombinant methods. These s fFv fragments 
were assembled from genes from monoclonals that had been 

15 previously isolated. . . , 

Whilst monoclonal antibodies, their fragments and 
derivatives have been enormously advantageous, there are 
nevertheless a number of limitations associated with 

them. , 

20 Firstly, the therapeutic applications or 

monoclonal antibodies produced by human immortal cell 
lines holds great promise for the treatment of a wide 
range of diseases (Clinical Applications of Monoclonal 
Antibodies. Edited by E. S. Lennox. British Medical 

25 Bulletin 1984. Publishers Churchill Livingstone). 

Unfortunately, immortal antibody-producing human cell 
lines are very difficult to establish and they give low 
Yields of antibody (approximately 1 ug/ml). In contrast, 
equivalent rodent cell lines yield high amounts of 

30 antibody (approximately 100 pg/ml). However, the 

repeated administration of these foreign rodent proteins 
to humans can lead to harmful hypersensitivity reactions. 
In the main therefore, these rodent -derived monoclonal 
antibodies have limited therapeutic use. 

35 secondly, a key aspect in the isolation of 

monoclonal antibodies is how many different clones of 
antibody producing cells with different specificities, 
can be practically established and sampled compared to 
how many theoretically need to be sampled in order to 

40 isolate a cell producing antibody with the desired 
specificity characteristics (Milstein, C, Royal Soc. 
Croonian Lecture, Proc. R. Soc. London B. 239; 1-16, 
(1990)). For example, the number of different 
specificities expressed at any one time by lymphocytes of 

45 the murine immune system is thought to be approximately 
10 7 and this is only a small proportion of the potential 
repertoire of specificities. However, during the 
isolation of a typical antibody producing cell with a 
desired specificity, the investigator is only able to 

50 sample 10 3 to 10 4 individual specificities. The problem 
is worse in the human, where one has approximately 10 
lymphocyte specificities, with the limitation on sampling 
of 10 3 or 10* remaining. . 

This problem has been alleviated to some extent in 

55 laboratory animals by the use of immunisation regimes. 
Thus where one wants to produce monoclonal antibodies 
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having a specificity against a particular epitope,, an 
animal is immunised with an immunogen expressing that 
epitope. The animal will then mount an immune response 
against the immunogen and there will be a proliferation 
5 of lymphocytes which have specificity against the 

epitope. Owing to this proliferation of lymphocytes with 
the desired specificity, it becomes easier to detect them 
in the sampling procedure. However, this approach is not 
successful in all cases, as a suitable immunogen may not 
10 be available. Furthermore, where one wants to produce 
human monoclonal antibodies (eg for therapeutic 
administration as previously discussed ) , such an approach 
is not practically, or ethically, feasible. 

In the last few years, these problems have in 
15 part, been addressed by the application of recombinant 
DNA methods to the isolation and production of e.g. 
antibodies and fragments of antibodies with antigen 
binding ability, in bacteria such as E . coli . 

This simple substitution of immortalised cells 
20 with bacterial cells as the •factory 1 , considerably 
simplifies procedures for preparing large amounts of 
binding molecules. Furthermore, a recombinant production 
system allows scope for producing tailor-made antibodies 
and fragments thereof. For example, it is possible to 
25 produce chimaeric molecules with new combinations of 
binding and effector functions, humanised antibodies 
(e.g. murine' variable regions combined with human 
constant domains or murine -antibody CDRs grafted onto a 
human FR) and novel antigen- binding molecules. 
30 Furthermore, the use of polymerase chain reaction (PCR) 
amplification (Saiki, R.K., et al., Science 239 , 487-491 
(1988)) to isolate antibody producing sequences from 
cells (e.g. hybridomas and B cells) has great potential 
for speeding up the timescale under which specificities 
35 can be isolated. Amplified VH and VL genes are cloned 
directly into vectors for expression in bacteria or 
mammalian cells (Orlandi, R. , et al., 1989, Proc. Natl. 
Acad. Sci., USA 86, 3833-3837; Ward, E.S., et al., 1989 
supra; Larrick, J.W., et al., 1989, Biochem. Biophys. 
40 Res. Coramun. 160, 1250-1255; Sastry, L. et al., 1989, 
Proc. Natl. Acad. Sci., USA. , 86/ 5728-5732). Soluble 
antibody fragments secreted from bacteria are then 
screened for binding activities. 

However, like the production system based upon 
45 immortalised cells, the recombinant production system 
still suffers from the selection problems previously 
discussed and therefore relies on animal immunization to 
increase the proportion of cells with desired 
specificity. Furthermore, some of these techniques can 
50 exacerbate the screening problems. For example, large 
separate H and L chain libraries have been produced from 
immunized mice and combined together in a random 
combinatorial manner prior to screening (Huse, W.D. et 
al., 1989, Science 246, 1275-1281, W090/14443; WO90/14424 
55 and W090/14430). Crucially however, the information held 
within each cell, namely the original pairing of one L 
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chain with one H chain, is lost. This loses some, of the 
advantage gained by using immunization protocols in the 
animal. Currently, only libraries derived from single VH 
domains (dAbs; Ward, E.S., et al., 1989, supra.) do not 
5 suffer this drawback. However, because not all antibody 
VH domains are capable of binding antigen, more have to 
be screened. In addition, the problem of directly 
screening many different specificities in prokaryotes 
remains to be solved. 
10 Thus, there is a need for a screening system which 

ameliorates or overcomes one or more of the above or 
other problems. The ideal system would allow the 
sampling of very large numbers of specificities (eg 10 
and higher), rapid sorting at each cloning round, and 
15 rapid transfer of the genetic material coding for the 
binding molecule from one stage of the production 
process, to the next stage. 

The most attractive candidates for this type of 
screening, would be prokaryotic organisms (because they 
grow quickly, are relatively simple to manipulate and 
because large numbers of clones can be created) which 
express and display at their surface a functional binding 
domain eg. an antibody, receptor, enzyme etc. In the UK 
patent GB 2137631B methods for the co-expression in a 
single host cell of the variable H and L chain genes of 
immunoglobulins were disclosed. However, the protein was 
expressed intracellular ly and was insoluble. Further, 
the protein required extensive processing to generate 
antibody fragments with binding activity and this 
generated material with only a fraction of the binding 
activity expected for antibody fragments at this 
concentration. It has already been shown that antibody 
fragments can be secreted through bacterial membranes 
with the appropriate signal peptide (Skerra, A. and 
35 Pluckthun, A. 1988 Science 240 1038-1040; Better, M et al 
1988, Science 240 1041-1043) with a consequent increase 
in the binding activity of antibody fragments. These 
methods require screening of individual clones for 
binding activity in the same way as do mouse monoclonal 

40 antibodies. . 

It has not been shown however, how a functional 
binding domain eg an antibody, antibody fragment, 
receptor, enzyme etc can be held on the bacterial surface 
in a configuration which allows sampling of say its 

45 antigen binding properties and selection for clones with 
desirable properties. In large part, this is because the 
bacterial surface is a complex structure, and in the 
gram-negative organisms there is an outer wall which 
further complicates the position. Further, it has not 

50 been shown that eg an antibody domain will fold correctly 
when expressed as a fusion with a surface protein of 
bacteria or bacteriophage. 

Bacteriophage are attractive prokaryote related 
organisms for this type of screening. In general, their 

55 surface is a relatively simple structure, they can be 
grown easily in large numbers, they are amenable to the 
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practical handling involved in many potential mass 
screening programmes, and they carry genetic information 
for their own synthesis within a small , simple package. 
The difficulty has been to practically solve the problem 
5 of how to use bacteriophages in this manner. A Genex 
Corporation patent application number WO88/O6630 has 
proposed that the bacteriophage lambda would be a 
suitable vehicle for the expression of antibody 
molecules, but they do not provide a teaching which 
10 enables the general idea to be carried out. For example 
WO88/06630 does not demonstrate that any sequences: (a) 
have been expressed as a fusion with gene V; (b) have 
been expressed on the surface of lambda; and (c) have 
been expressed so that the protein retains biological 

15 activity. Furthermore there is no teaching on how to 
screen for suitable fusions. Also, since the lambda 
virions are assembled within the cell, the fusion protein 
would be expressed intracellularly and would be predicted 
to be inactive. Bass et al., in December 1990 describe 

20 deleting part of gene III of the filamentous 

bacteriophage M13 and inserting the coding sequence for 
human growth hormone (hGH) into the N- terminal site of 
the gene. The growth hormone displayed by M13 was shown 
to be functional. (Bass, S., et al. Proteins, 

25 Structure, Function and Genetics (1990) 8: 309-314). A 
functional copy of gene III was always present in 
addition, when this fusion was expressed. A Protein 
Engineering Corporation patent application W090/02809 
proposes the insertion of the coding sequence for bovine 

30 pancreatic trypsin inhibitor (BPTI) into gene VIII of 
M13. However, the proposal was not shown to be 
operative. For example, there is no demonstration of the 
expression of BPTI sequences as fusions with protein VIII 
and display on the surface of M13. Furthermore this 

35 document teaches that when a fusion is made with gene 
III, it is necessary to use a second synthetic copy of 
gene III, so that some unaltered gene III protein will be 
present. The embodiments of - the present application do 
not do this. In embodiments where phagemid is rescued 

40 with M13K07 gene III deletion phage, there is no 
unaltered gene III present. 

WO90/02809 also teaches that phagemids that do not 
contain the full genome of M13 and require rescue by 
coinfection with helper phage are not suitable for these 

45 purposes because coinfection could lead to recombination. 

In all embodiments where the present applicants 
have used phagemids, they have used a helper phage and 
the only sequences derived from filamentous bacteriophage 
in the phagemids are the origin of replication and gene 

50 III sequences. 

WO90/02809 also teaches that their process needed 
information such as nucleotide sequence of the starting 
molecule and its three-dimensioned structure. The use of 
a pre-existing repertoire of binding molecules to select 

55 for a binding member, such as is disclosed herein, for 
example using an immunoglobulin gene repertoire of 
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animals, was not disclosed. Further, they do not discuss 
favouring variegation of their binding molecules in 
natural blocks of variation such as CDRs of 
immunoglobulins, in order to favour generation of 
improved molecules and prevent unfavourable variations. 
WO90/02809 also specifically excluded the application of 
their process to the production of scFv molecules. 

In each of the above discussed patents (WO88/06630 
and WO90/02809), the protein proposed for display is a 
single polypeptide chain. There is no disclosure of a 
method for the display of a dimeric molecule by 
expression of one monomer as a fusion with a capsxd 
protein and the other protein in a free form. 

Another disclosure published in May 1991 descrxbes 
15 the insertion into gene VIII of M13, the coding sequences 
for one of the two chains of the Fab portion of an 
antibody with co-expression of the other from a plasmxd. 
The two chains were demonstrated as being expressed as a 
functional Fab fragment on the surface of the phage (Kang 
20 A S. et al., (1991) Proc. Natl. Acad. Sci, USA, 88 p4363- 
4366). No disclosure was made of the site of insertion 
into gene VIII and the assay for pAb binding activity by 
ELISA used a reagent specific for antibody L chain rather 
than for phage. A further disclosure published xn March 
25 1991 descrxbes the insertion of a fragment of the AIDS 
virus protein gag into the N-tenninal portion of gene III 
of the bacteriophage fd. The expression of the gag 
protein fragment was detected by immunologxcal methods, 
but it was not shown whether or not the protein was 
30 expressed in a functional form ( Tsunetsugu-Yokota Y et 
al. (1991) Gene 9£ p261-265). . 

The problem of how to use bacteriophages xn this 
way is in fact a difficult one. The protein must be 
inserted into the phage in such a way that the integrity 
35 of the phage coat is not undermined, and the protein 
itself should be functional retaining its biological 
activity with respect to antigen binding. Thus, where 
the protein of choice is an antibody, it should fold 
efficiently and correctly and be presented for antigen , 
40 binding. Solving the problem for antibody molecules and 
fragments would also provide a general method for any 
biomolecule which is a member of a specific binding pair 
e.g. receptor molecules and enzymes. 

Surprisingly, the applicants have been able to 
45 construct a bacteriophage that expresses and displays at 
its surface a large biologically functional binding 
molecule (eg antibody fragments, and enzymes and 
receptors) and which remains intact and infectious. This 
is described in WO 92/01047, the disclosure of which is 
50 herein incorporated by reference. Readers of the present 
document are urged to consult WO 92/01047 for detailed 
explanation of many of the procedures used in the 
experiments described herein. The applicants have called 
the structure which comprises a virus particle and a 
55 binding molecule displayed at the viral surface a 

'package'. Where the binding molecule is an antibody, an 
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antibody derivative or fragment, or a domain that is 
homologous to an immunoglobulin domain, the applicants 
call the package a 1 phage antibody 1 (pAb). However, 
except where the context demands otherwise, where the 
5 term phage antibody is used generally,, it should also be 
interpreted as referring to any package comprising a 
virus particle and a biologically functional binding 
molecule displayed at the viral surface. 

pAbs have a range of applications in selecting 

10 antibody genes encoding antigen binding activities. For 
example, pAbs could be used for the cloning and rescue of 
hybridomas (Orlandi, et al (1989) PNAS 86 p3833- 

3837 ), and in the screening of large combinatorial 
libraries (such as found in Huse, W.D. et al., 1989, 

15 Science 246 . 1275-1281). In particular, rounds of 
selection using pAbs may help in rescuing the higher 
affinity antibodies from the latter libraries. It may be 
preferable to screen small libraries derived from 
antigen-selected cells (Casali, P., et al., (1986) 

20 Science 234 p476-479) to rescue the original VH/VL pairs 
comprising the Fv region of an antibody. The use of pAbs 
may also allow the construction of entirely synthetic 
antibodies. Furthermore, antibodies may be made which 
have some synthetic sequences e.g. CDRs, and some 

25 naturally derived sequences. For example, V-gene 
repertoires could be made in vitro by combining un- 
rearranged V* genes, with D and J segments. Libraries of 
pAbs could then be selected by binding to antigen, 
hypermutated in vitro in the antigen-binding loops or V 

30 domain framework regions, and subjected to further rounds 
of selection and mutagenesis. 

As previously discussed, separate H and L chain 
libraries lose the original pairing between the chains. 
It is difficult to make and screen a large enough library 
35 for a particularly advantageous combination of H and L 
chains . 

For example, in a mouse there are approximately 10 7 
possible H chains and 10 7 possible L chains. Therefore, 
there are 10 14 possible combinations of H and L chains, 
40 and to test for anything like this number of combinations 
one would have to create and screen a library of about 
10 14 clones. 

The present invention provides approaches which 
ameliorate this problem. 

45 In one approach as large a library as is 

practically possible is created which expresses as many 
of the 10 14 potential combinations as possible. However, 
by virtue of the expression of the H and L chains on the 
surface of the phage, it is reasonably practicable to 

50 select the desired combination, from all the generated 
combinations by affinity techniques (see later for 
description of selection formats ) . 

In an approach (called a poly combinatorial 
approach by the present applicants), a large library is 

55 created from two smaller libraries for selection of the 
desired combination. The approach involves the creation 
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of: (i) a first library of say 10 7 e.g. H chains which are 
displayed on a bacteriophage (as a fusion with the 
protein encoded by gene III) which is resistant to e.g. 
tetracycline; and (ii) a second library of say 10 e.g. L 
5 chains in which the coding sequences for these light 
chains are within a plasmid vector and are expressed in 
the periplasmic space of a host bacterium. The first 
library is then used to infect the bacteria containing 
the second library to provide 10" combinations of H and L 

10 chains on the surface of the resulting phage in the 
bacterial supernatant. 

The advantage of this approach is that two 
separate libraries of eg 10 7 are created in order to 
produce 10" combinations. Creating a 10 library is a 

15 practical possibility. # 

The 10" combinations are then subjected to 
selection (see later for description of selection 
formats) as disclosed by the present application. This 
selection will then produce a population of phages 

20 displaying a particular combination of H and L chains 
having the desired specificity. The phages selected 
however, will only contain DNA encoding one partner of 
the paired H and L chains. The sample eluate containing 
the population is then divided into two portions. A 

25 first portion is grown on e.g. tetracycline plates to 
select those bacteriophage containing DNA encoding H 
chains which are involved in the desired antigen binding. 
A second portion is grown on e.g. ampicillin plates to 
select those bacteriophage containing phageraid DNA 

30 encoding L chains which are involved in the desired 
antigen binding. A set of colonies from individually 
isolated clones e.g. from the tetracycline plates are 
then used to infect specific colonies e.g. from the 
ampicillin plates. This results in bacteriophage 

35 expressing specific combinations of H and L chains which 
can then be assayed for antigen binding. 

In another approach (called a hierarchical dual 
combinational approach or chain shuffling by the present 
applicants), an individual colony from either the H or L 

40 chain clone selected by growth on the antibiotic plates, 
is used to infect a complete library of clones encoding 
the other chain (H or L). Selection is as described 
above. This favours isolation of the most favourable 
combination. 

45 phagemids have been mentioned above. The 

applicants have realised and demonstrated that in many 
cases phagemids will be preferred to phage for cloning 
antibodies because it is easier to use them to generate 
more comprehensive libraries of the immune repertoire. 

50 This is because the phagemid DNA is approximately 100 
times more efficient than bacteriophage DNA in 
transforming bacteria (see example 19 of WO 92/01047). 
Also, the use of phagemids gives the ability to vary the 
number of gene III binding molecule fusion proteins 

55 displayed on the surface of the bacteriophage (see 

example 17 of WO 92/01047). For example, in a system 
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comprising a bacterial cell containing a phageraid 
encoding a gene III fusion protein and infected with a 
helper phage, induction of expression of the gene III 
fusion protein to different extents, will determine the 
5 number of gene III fusion proteins present in the space 
defined between the inner and outer bacterial membranes 
following superinfection. This will determine the ratio 
of gene III fusion protein to native gene III protein 
displayed by the assembled phage. 
10 Expressing a single fusion protein per virion may 

aid selection of antibody specificities on the basis of 
affinity by avoiding the 'avidity' effect where a phage 
expressing two copies of a low affinity antibody would 
have the same apparent affinity as a phage expressing one 
15 copy of a higher affinity antibody. In some cases 

however, it will be important to display all the gene III 
molecules derived by superinfection of cells containing 
phagemids to have fusions (e.g. for selecting low 
affinity binding molecules or improving sensitivity on 
20 ELISA) . One way to do this is to superinfect with a 

bacteriophage which contains a defective gene III. The 
applicants have therefore developed and used a phage 
which is deleted in gene III, described in WO 92/01047. 
The demonstration that a functional antigen- 
25 binding domain can be displayed on the surface of phage, 
has implications beyond the construction of novel 
antibodies. ' For example, if other protein domains can be 
displayed at the surface of a phage, phage vectors could 
be used to clone and select genes by the binding 
30 properties of the displayed protein. Furthermore, 

variants of proteins, including epitope libraries built 
into the surface of the protein, could be made and 
readily selected for binding activities. In effect, 
other protein architectures might serve as "nouvelle" 
3 5 antibodies . 

The technique provides the possibility of building 
antibodies from first principles, taking advantage of the 
structural framework on which the antigen binding loops 
fold. In general, these loops have a limited number of 
40 conformations which generate a variety of binding sites 
by alternative loop combinations and by diverse side 
chains. Recent successes in modelling antigen binding 
sites augurs well for de novo design. In any case, a 
high resolution structure of the antigen is needed. 
45 However, the approach is attractive for making e.g. 

catalytic antibodies, particularly for small substrates. 
Here side chains or binding sites for prosthetic groups 
might be introduced, not only to bind selectively to the 
transition state of the substrate, but also to 
50 participate directly in bond making and breaking. The 
only question is whether the antibody architecture, 
specialised for binding, is the best starting point for 
building catalysts. Genuine enzyme architectures, such 
as the triose phosphate isomerase (TIM) barrel, might be 
55 more suitable. Like antibodies, TIM enzymes also have a 
framework structure (a barrel of B-strands and a-helices) 
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and loops to bind substrate. Many enzymes with a 
diversity of catalytic properties are based on this 
architecture and the loops might be manipulated 
independently on the frameworks for design of new 
5 catalytic and binding properties. The phage selection 
system as provided by the present disclosure can be used 
to select for antigen binding activities and the CDR 
loops thus selected, used on either an antibody framework 
or a TIM barrel framework. Loops placed on a e.g. a TIM 
10 barrel framework could be further modified by mutagenesis 
and subjected to further selection. Thus, there is no 
Led to select for high affinity binding activities in a 
single step. The strategy of the immune system, in which 
low affinity evolves to high affinity seems more 
15 realistic and can be mimicked using this invention. 

One class of molecules that could be useful in 
this type of application are receptors. For example, a 
specific receptor could be displayed on the surface of 
the phage such that it would bind its ligand. The 
receptor could then be modified by, for example, is yi£ro 



20 



mutagenesis and variants having higher binding affinity 
for the ligand selected. The selection may be carried 
out according to one or more of the formats described 
below 

25 * Alternatively, the phage-receptor could be used as 

the basis of a rapid screening system for the binding of 
ligands, altered ligands, or potential drug candidates. 
The advantages of this system namely of simple cloning, 
convenient expression, standard reagents and easy 

30 handling makes the drug screening application 

particularly attractive. In the context of this 
discussion, receptor means a molecule that binds a 
specific, or group of specific, ligand(s). The natural 
receptor could be expressed on the surface of a 

35 population of cells, or it could be the extracellular 
domain of such a molecule (whether such a form exists 
naturally or not), or a soluble molecule performing a 
natural binding function in the plasma, or within a cell 
or organ . 

40 Another possibility, is the display of an enzyme 

molecule or active site of an enzyme molecule on the 
surface of a phage (see examples 11,12,30,31,32 and 36 of 
W0 92/01047). Once the phage enzyme is expressed, it can 
be selected by affinity chromatography, for instance on 

45 columns derivatized with transition state analogues. If 
an enzyme with a different or modified specificity is 
desired, it may be possible to mutate an enzyme displayed 
as a fusion on bacteriophage and then select on a column 
derivatised with an analogue selected to have a higher 

50 affinity for an enzyme with the desired modified 
specificity- 

Although throughout this application, the 
applicants discuss the possibility of screening for 
higher affinity variants of pAbs, they recognise that in 

55 some applications, for example low affinity 

chromatography (Ohlson, S. et al Anal. Biochem. 169, 
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p204-208 (1988)), it may be desirable to isolate lower 
affinity variants. 

pAbs also allow the selection of antibodies for 
improved stability. It has been noted for many 
5 antibodies, that yield and stability are improved when 
the antibodies are expressed at 30 °C rather than 37 °C. 
If pAbs are displayed at 37 °C, only those which are 
stable will be available for affinity selection. When 
antibodies are to be used in vivo for therapeutic or 
10 diagnostic purposes, increased stability would extend the 
half-life of antibodies in circulation. 

Although stability is important for all antibodies 
and antibody domains selected using phage, it is 
particularly important for the selection of Fv fragments 
15 which are formed by the non-covalent association of VH 
and VL fragments. Fv fragments have a tendency to 
dissociate and have a much reduced half-life in 
circulation compared to whole antibodies. Fv fragments 
are displayed on the surface of phage, by the association 

20 of one chain expressed as a gene III protein fusion with 
the complementary chain expressed as a soluble fragment. 
If pairs of chains have a high tendency to dissociate, 
they will be much less likely to be selected as pAbs. 
Therefore, the population will be enriched for pairs 

25 which do associate stably. Although dissociation is less 
of a problem with Fab fragments, selection would also 
occur for Fab fragments which associate stably. pAbs 
allow selection for stability to protease attack , only 
those pAbs that are not cleaved by proteases will be 

30 capable of binding their ligand and therefore populations 
of phage will be enriched for those displaying stable 
antibody domains. 

The technique of displaying binding molecules on 
the phage surface can also be used as a primary cloning 

35 system. For example, a cDNA library can be constructed 
and inserted into the bacteriophage and this phage 
library screened for the ability to bind a ligand. The 
ligand/binding molecule combination could include any 
pair of molecules with an ability to specifically bind to 

40 one another e.g. receptor/ ligand, enzyme/ substrate (or 

analogue), nucleic acid binding protein/nucleic acid etc. 
If one member of the complementary pair is available, 
this may be a preferred way of isolating a clone for the 
other member of the pair. 

45 It will often be necessary to increase the 

diversity of a population of genes cloned for the display 
of their proteins on phage or to mutate an individual 
nucleotide sequence. Although in vitro or in vivo 
mutagenesis techniques could be used for either purpose, 

50 a particularly suitable method would be to use mutator 
strains. A mutator strain is a strain which contains a 
genetic defect which causes DNA replicated within it to 
be mutated with respect to its parent DNA. Hence if a 
population of genes as gene III fusions is introduced 

55 into these strains it will be further diversified and can 
then be transferred to a non-mutator strain, if desired, 
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for display and selection. 
Targeted gene transfer 

A useful and novel set of applications makes use 
of the binding protein on the phage to target the phage 
5 genome to a particular cell or group of cells. For 

example, a pAb specific for a cell surface molecule could 
be used to bind to the target cell via the surface 
molecule. The phage could then be internalised, either 
through the action of the receptor itself or as the 

10 result of another event (e.g. an electrical discharge 

such as in the technique of electroporation ) . The phage 
genome would then be expressed if the relevant control 
signals (for transcription and translation and possibly 
replication) were present. This would be particularly 

15 useful if the phage genome contained a sequence whose 

expression was desired in the target cell (along with the 
appropriate expression control sequences). A useful 
sequence might confer antibiotic resistance to the, 
recipient cell or label the cell by the expression of its 

20 product (e.g. if the sequence expressed a detectable gene 
product such as a lucif erase, see White, M, et al. 
Techniques 2(4), pl94-201 (1990)), or confer a particular 
property on the target cell (e.g. if the target cell was 
a tumour cell and the new sequence directed the 

25 expression of a tumour suppressing gene), or express an 
antisense construct designed to turn off a gene or set of 
genes in the" target cell, or a gene or gene product 
designed to be toxic to the target cell. 

Alternatively, the sequence whose expression is 

30 desired in the target cell can be encoded on a phagemid. 
The phagemid DNA may then be incorporated into a phage 
displaying an antibody specific for a cell surface 
receptor. For example, incorporation may be by 
superinfection of bacteria containing the phagemid, with 

35 a helper phage whose genome encodes the antibody fragment 
specific for the target cell. The package is then used 
to direct the phagemid to the target cell. 

This technique of "targeted gene transfer" has a 
number of uses in research and also in therapy and 

40 diagnostics. For example, gene therapy often aims to 

target the replacement gene to a specific cell type that 
is deficient in its activity. Targetting pAbs provide a 
means of achieving this. 

In diagnostics, phage specific for particular 

45 bacteria or groups of bacteria have been used to target 
marker genes, e.g. lucif erase, to the bacterial host 
(sec, for example, Ulitzer, S., and Kuhn, J., EPA 
85303913.9). If the host range of the phage is 
appropriate, only those bacteria that are being tested 

50 for, will be infected by the phage, express the 

lucif erase gene and be detected by the light they emit. 
This system has been used to detect the presence of 
Salmonella. One major problem with this approach is the 
initial isolation of a bacteriophage with the correct 

55 host range and then the cloning of a luciferase gene 
cassette into that phage, such that it is functional. 
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The pAb system allows the luciferase cassette to be 
cloned into a well characterised system (filamentous 
phage) and allows simple selection of an appropriate host 
range , by modifying the antibody (or other binding 
5 molecule) specificity that the pAb encodes. 

The present applicants have also been able to 
develop novel selection systems and assay formats which 
depend on the unique properties of these replicable 
genetic display packages e.g. pAbs. 
10 TERMINOLOGY 

Much of the terminology discussed in this section 
has been mentioned in the text where appropriate. 
Specific Binding Pair (sbp) 

This describes a pair of molecules (each being a 
15 member of a specific binding pair) which are naturally 
derived or synthetically produced. One of the pair of 
molecules, has an area on its surface, or a cavity which 
specifically binds to, and is therefore defined as 
complementary with a particular spatial and polar 
20 organisation of the other molecule, so that the pair have 
the property of binding specifically to each other. 
Examples of types of specific binding pairs are antigen- 
antibody, biotin-avidin, hormone-hormone receptor, 
receptor- ligand, enzyme-substrate, lgG-protein A. 
25 Multimeric Member 

This describes a first polypeptide which will 
associate with at least a second polypeptide, when the 
polypeptides are expressed in free form and/or on the 
surface of a substrate. The substrate may be provided by 
30 a bacteriophage. Where there are two associated 

polypeptides, the associated polypeptide complex is a 
dimer, where there are three, a trimer etc. The dimer, 
triraer, mul timer etc or the multimeric member may 
comprise a member of a specific binding pair. 
35 Example multimeric members are heavy domains based 

on an immunoglobulin molecule, light domains based on an 
immunoglobulin molecule, T-cell receptor subunits. 
Replicable Genetic Display Package (Radp) 

This describes a biological particle which has 
40 genetic information providing the particle with the 

ability to replicate. The particle can display on its 
surface at least part of a polypeptide. The polypeptide 
can be encoded by genetic information native to the 
particle and/or artificially placed into the particle or 
45 an ancestor of it. The displayed polypeptide may be any 
member of a specific binding pair eg. heavy or light 
chain domains based on an immunoglobulin molecule, an 
enzyme or a receptor etc. 

The particle may be a virus eg. a bacteriophage 
50 such as fd or M13. 
Package 

This describes a replicable genetic display 
package in which the particle is displaying a member of a 
specific binding pair at its surface. The package may be 
55 a bacteriophage which displays an antigen binding domain 
at its surface. This type of package has been called a 



WO 92/20791 



PCT/GB92/00883 



14 



phage antibody ( pAb ) . 

Antibody^ describes an immunoglobulin whether natural 
or partly or wholly synthetically produced. The term 
5 also covers any protein having a binding domain which is 
homologous to an immunoglobulin binding domain. These 
proteins can be derived from natural sources, or partly 
or wholly synthetically produced. , , 4< „ vl __- - 

Example antibodies are the immunoglobulin isotypes 

10 and the Fab, F(ab l ) a , scFv, Fv, dAb, Fd fragments. 

Immunoglo bulin Sunerfamilv 

This describes a family of polypeptides, the 
members of which have at least one domain with a 
structure related to that of the variable or constant 

15 domain of immunoglobulin molecules. The domain contains 
two B-sheets and usually a conserved disulphide bond (see 
A.F. Williams and A.N. Barclay 1988 Ann. Rev Immunol. 6 

381 " 4 ° 5 Example members of an immunoglobulin superfamily 
20 are CD4, platelet derived growth factor receptor ( PDGFR) , 
intercellular adhesion molecule. (ICAM). Except where 
the context otherwise dictates, reference to 
immunoglobulins and immunoglobulin homologs in this 
application includes members of the immunoglobulin 
25 superfamily and homologs thereof. 

Homologs . , . . ____ 

This term indicates polypeptides having the same 

or conserved residues at a corresponding position in 

their primary, secondary or tertiary structure. The term 
30 also extends to two or more nucleotide sequences encoding 

the homologous polypeptides . 

Example homologous peptides are the immunoglobulin 

isotypes . 

Functional ^ . , ^ . ■ 

35 m relation to a sbp member displayed on the 

surface of a rgdp, means that the sbp member is presented 
in a folded form in which its specific binding domain for 
its complementary sbp member is the same or closely 
analogous to its native configuration, whereby it 

40 exhibits similar specificity with respect *° t*® 

complementary sbp member. In this respect it differs 
from the peptides of Smith et al, supra, which do not 
have a definite folded configuration and can assume a 
variety of configurations determined by the complementary 

45 members with which they may be contacted. 
Genetically dive rse population 

In connection with sbp members or polypeptide 
components thereof, this is referring not only to 
diversity that can exist in the natural population of 

50 cells or organisms, but also diversity that can be 
created by artificial mutation in vitro or in yiyo. 

Mutation in vitro may for example, involve random 
mutagenesis using oligonucleotides having random 
mutations of the sequence desired to be varied. In yivg. 

55 mutagenesis may for example, use mutator strains of host 
microorganisms to harbour the DNA (see Example 38 of WO 
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92/01047), The word "population" itself may be used to 
denote a plurality of e.g. polypeptide chains, which are 
not genetically diverse i.e. they are all the same. 
Domain 

5 A domain is a part of a protein that is folded 

within itself and independently of other parts of the 
same protein and independently of a complementary binding 
member. 
Folded Unit 

10 This is a specific combination of an a-helix 

and/or B-strand and/or 13- turn structure. Domains and 
folded units contain structures that bring together amino 
acids that are not adjacent in the primary structure. 
Free Form 

15 This describes the state of a polypeptide which is 

not displayed by a replicable genetic display package. 

Conditionally Defective 

This describes a gene which does not express a 

particular polypeptide under one set of conditions, but 
20 expresses it under another set of conditions. An 

example, is a gene containing an amber mutation expressed 

in non-suppressing or suppressing hosts respectively. 

Alternatively, a gene may express a protein which 

is defective under one set of conditions, but not under 
25 another set. An example is a gene with a temperature 

sensitive mutation. 

Suppressible' Translational Stop Codon 

This describes a codon which allows the 
translation of nucleotide sequences downstream of the 
30 codon under one set of conditions, but under another set 
of conditions translation ends at the codon. Example of 
suppressible translational stop codons are the amber, 
ochre and opal codons. 
Mutator Strain 

35 This is a host cell which has a genetic defect 

which causes DNA replicated within it to be mutated with 
respect to its parent DNA. Example mutator strains are 
NR9046mutD5 and NR9046 mut Tl (see Example 38). 
Helper Phage 

40 This is a phage which is used to infect cells 

containing a defective phage genome and which functions 
to complement the defect. The defective phage genome can 
be a phagemid or a phage with some function encoding gene 
sequences removed. Examples of helper phages are M13K07, 

45 M13K07 gene III no. 3; and phage displaying or encoding 
a binding molecule fused to a capsid protein. 
Vector 

This is a DNA molecule, capable of replication in 
a host organism , into which a gene is inserted to 
50 construct a recombinant DNA molecule. 
Phage Vector 

This is a vector derived by modification of a 
phage genome, containing an origin of replication for a 
bacteriophage, but not one for a plasmid. 
55 Phagemid Vector 

This is a vector derived by modification of a 
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plasmid genome, containing an origin of replication for 
bacteriophage as well as the plasmid origin of 
replication . 

Secreted ^ describes a rgdp or moleC ule that associates 

with the member of a sbp displayed on * he K r 9 d P' } n " h ^ h 
the sbp member and/or the molecule, have been folded and 
the package assembled externally to the cellular cytosol. 
Bonprtnire of Rearranged immunoglobulin Genes 

— U a collection of naturally occurring nucleotides eg 

DNA sequences which encoded expressed immunoglobulin 
genes in an animal. The sequences are generated by the 
in vivo rearrangement of eg V, D and J segments for H 
chains and eg the V and J segments for L chains. 
Alternatively the sequences may be generated from a ceii 
Ylne^mmunised in vitro and in which the rearrangement m 
response to immunisation occurs intracellular ly. Tne 
word "repertoire" is used to indicate genetic diversity. 
Libr ary 

A collection of nucleotide eg DNA, sequences 

within clones; or a genetically diverse collection of 
polypeptides, or specific binding pair members, or 
polypeptides or sbp members displayed on rgdps capable of 
selection or screening to provide an individual 
25 polypeptide or sbp members or a mixed population of 
polypeptides or sbp members. 

go pj^ire n't Arti f Tp-iallv Rearranged Immunoglobulin 
G enes 

A collection of nucleotide eg DNA, sequences 

30 derived wholly or partly from a source other than the 

rearranged immunoglobulin sequences from an animal. Tnis 
may include for example, DNA sequences encoding VH 
domains by combining unrearranged V segments with D and J 
segments and DNA sequences encoding VL domains by 
35 combining V and J segments. , . , . 

Part or all of the DNA sequences may be derived by 

oligonucleotide synthesis . 
Secretory Leader Peptide 

This is a sequence of amino acids joined to the N- 

40 terminal end of a polypeptide and which directs movement 
of the polypeptide out of the cytosol. 

This is a solution used to breakdown the linkage 

between two molecules. The linkage can be a *on-covalent 
45 or covalent bond(s). The two molecules can be members o* 

a sbp. 
Derivative 

This is a substance which derived from a 

polypeptide which is encoded by the DNA within a selected 

50 rgdp. The derivative polypeptide may differ from the 
encoded polypeptide by the addition, deletion 
substitution or insertion of amino acids, or by the 
linkage of other molecules to the encoded polypetide. 
These changes may be made at the nucleotide or protein 

55 level. For example the encoded polypeptide may be a Fab 
fragment which is then linked to an Fc tail from another 
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source. Alternatively markers such as enzymes, 
flouresceins etc may be linked to eg Fab, scFv 
fragments. 

According to one aspect of the invention, there is 
5 provided a method of producing multimeric specific 
binding pair (sbp) members, which method comprises 
expressing from a vector in recombinant host organism 
cells a population of a first polypeptide chain of a 
specific binding pair member fused to a component of a 

10 secreted replicable genetic display package (rgdp) which 
thereby displays said polypeptide chains at the surface 
of rgdps, and combining said population with a population 
of a second polypeptide chain of said specific binding 
pair member by causing or allowing first and second 

15 polypeptide chains to come together to form a library of 
said multimeric specific binding pair members displayed 
by rgdps, said population of second polypeptide chains 
not being expressed from the same vector as said 
population of first polypeptide chains, at least one of 

20 said populations being genetically diverse and expressed 
from nucleic acid that is capable of being packaged using 
said rgdp component, whereby the genetic material of each 
said rgdp encodes a polypeptide chain of a said 
genetically diverse population. 

25 The first and second polypeptide chains may be expressed 
in the same host organism cell, or not expressed in the 
same host organism cells. In the latter case the 
population of second polypeptide chains may comprise a 
repertoire of polypeptides purified from a human or 

30 animal source. 

In a prefered method each said polypeptide chain 
is expressed from nucleic acid which is capable of being 
packaged as a rgdp using said component fusion product. 
The method may comprise introducing vectors 

35 capable of expressing a population of said first 

polypeptide chains into host organisms which express a 
population of said second polypeptide chains in free 
form, or introducing vectors capable of expressing a 
population of said second polypeptide chains in free form 

40 into host organisms which express a population of said 
first polypeptide chains. 

On the other hand each said second polypeptide 
chains may be each expressed as a fusion with a component 
of a rgdp which thereby displays said second polypeptide 

45 chains at the surface of rgdps. In other words, both 
first and second chains may be displayed on rgdps by 
fusion to, for example, a capsid problem. In a method 
where soluble chains are combined with fusions of 
polypeptide chains and rgdp component the following steps 

50 may be included: 

(a) forming an extracellular mixture of a 
population of soluble second polypeptide chains and rgdps 
displaying a population of first polypeptide chains; and 

(b) causing or allowing first and second 

55 polypeptide chains to come together to form the library 
of said multimeric specific binding pair members. 
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The extracellular mixture may be partially 
denatured before being renatured to cause or allow said 
first and second polypeptide chains to come together to 
form said library. The population of second polypeptide 
5 chains may comprise a repertoire of polypeptides purified 
from a human or animal source. 

The populations of said polypeptide chains may be 

derived from: 

(i) the repertoire of rearranged 

10 immunoglobulin genes of an animal immunised with 
complementary sbp member; 

(ii) the repertoire of rearranged 
immunoglobulin genes of an animal not immunised with 
complementary sbp member; 

15 (iii) a repertoire of an artificially rearranged 

immunoglobulin gene or genes; 

(iv) a repertoire of an immunoglobulxn 

homolog gene or genes; or ^ 

( V ) a repertoire of sequences derxved 

20 from a germ- line immunoglobulin gene or genes; 

(vi) a repertoire of an immunoglobulin 
gene or genes artificially mutated by the introduction of 
one or more point mutations. 

(vii) a mixture of any of (i), (ii), 
25 (iii), (iv), (v) and (vi). 

When a phage is used as rgdp it may be selected 
from the class I phages fd, M13, fl, I£l, Ike, ZJ/Z, Ff 
and the class II phages Xf, Pfl and Pf3. 

Following combination rgdps may be selected or 

30 screened to provide an individual sbp member or a mixed 
population of said sbp members associated in their 
respective rgdps with nucleic acid encoding a polypeptide 
chain thereof. The restricted population of at least one 
type of polypeptide chain provided in this way may then 

35 be used in a further dual combinational method in 

slection of an individual, or a restricted population of 
complementary chain. 

Nucleic acid taken from a restricted rgdp 
population encoding said first polypeptide chains may be 

40 introduced into a recombinant vector into which nucleic 
acid from a genetically diverse repertoire of nucleic 
acid encoding said second polypeptide chains is also 
introduced, or the nucleic acid taken from a restricted 
rgdp population encoding said second polypeptide chains 

45 may be introduced into a recombinant vector into which 
nucleic acid from a genetically diverse repertoire of 
nucleic acid encoding said first polypeptide chains is 
also introduced. 

The recombinant vector may be produced by 

50 intracellular recombination between two vectors and this 
may be promoted by inclusion in the vectors of sequences 
at which site-specific recombination will occur, such as 
loxP sequences obtainable from coliphage PI. Site- 
specific recombination may then be catalysed by Cre- 

55 recombinase, also obtainable from coliphage PI. 

The Cre-recombinase used may be expressible under 
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the control of a regulatable promoter. 

Production of a recombinant vector may be used to 
produce nucleic acid encoding a single chain Fv region 
derivative of an immunoglobulin resulting from 
5 recombination between first and second vectors. 

It may be desirable for the vector comprising 
nucleic acid encoding the first polypeptide chain to be a 
phage or phagemid while the vector comprising nucleic 
acid encoding the second polypeptide chain being a 
10 plasmid; or the vector comprising nucleic acid encoding 
the first polypeptide chain to be a plasmid while the 
vector comprising nucleic acid encoding the second 
polypeptide chain is a phage or phagemid. Then, the 
intracellular recombination may take place in a bacterial 
15 host which replicates plasmids preferentially over phages 
or phagemids, or which replicates phages or phagemids 
preferentially over plasmids. It may be advantageous to 
use a system wherein the preferential replication of one 
type of vector over the other is conditional. 
20 This is discussed later with reference to PolA 

strain of E.coli or of another grain-negative bacterium. 

The invention envisages also a method of producing 
multimeric specific binding pair (sbp) members, which 
method comprises 

25 (i) causing or allowing intracellular 

recombination between (a) first vectors comprising 
nucleic acid encoding a population of a fusion of a first 
polypeptide chain of a specific binding pair member and a 
component of a secreted replicable genetic display 

30 package (rgdp) and (b) second vectors comprising nucleic 
acid encoding a population of a second polypeptide chain 
of a specific binding pair member, at least one of said 
populations being genetically diverse, the recombination 
resulting in recombinant vectors each of which comprises 

35 nucleic acid encoding a said polypeptide fusion and a 
said second polypeptide chain and capable of being 
packaged using said rgdp component; and 

(ii) expressing said polypeptide fusions and 
said second polypeptide chains, producing rgdps which 

40 display at their surface said first and second 

polypeptide chains and which each comprise nucleic acid 
encoding a said first polypeptide chain and a said second 
polypeptide chain. 

This may be with or without a preliminary 

45 selection or restriction of one of the populations of 
second polypeptide chains by any other method according 
to the invention. 

An important aspect of the present invention 
provides a method of producing one or a selected 

50 population of multichain polypeptide members of a 
specific binding pair (sbp members) specific for a 
counterpart specific binding pair member of interest , 
which method comprises the following steps: 
( i ) expressing from a vector in recombinant host 

55 organism cells a genetically diverse population of a 

first polypeptide chain of said multichain protein, fused 



SUBSTITUTE SHEET 



WO 92/20791 



PCT/GB92/00883 



20 

to a component of a replicable genetic display package 
(rgdp) which thereby displays said polypeptide chains at 
the surface of rgdps; 

(ii) combining said population with a unique or 

5 restricted population of second polypeptide chaxns of 

said multichain sbp members, not being expressed from the 
same vector as said population of first polypeptide 
chains, said combining forming a library of saxd 
multichain sbp members displayed by rgdps, said 
10 genetically diverse population being expressed from 
nucleic acid which is capable of being packaged using 
said rgdp component, whereby the genetic material of each 
said rgdp encodes a said first polypeptide chain; 

(iii) selecting by affinity with said counterpart sbp 
15 member of interest multichain sbp members specific for 

said counterpart sbp member associated in their 
respective rgdps with nucleic acid encoding a said first 
polypeptide chain thereof; 

(iv) combining said first polypeptide chains of 

20 multichain sbp members selected in step (iii) with a 
genetically diverse population of second polypeptide 
chains of multichain sbp members, the said second 
polypeptide chains being fused to a component of a rgdp 
which thereby displays them at the surface of rgdps, the 

25 said combining in this step (iv) forming a library of 

multichain sbp members from which one or more multichain 
sbp members specific for said counterpart sbp member are 
selectable by affinity with it. 

These multichain sbp members may be antibodies, or 

30 other members of the immunoglobulin family, or binding 
fragments thereof, or any other multimeric sbp member. 
See elsewhere in this text for other examples. 

Advantages and benefits of such a method are 
discussed elsewhere in this application. This technique 

35 may be modified for "humanising" antibodies, optionally 
in combination with CDR grafting and perhaps with the use 
of chimaeric polypeptide chains. Useful chimaerics may 
comprise a variable domain derived from a non-human 
animal antibody specific for the antigen of interest and 

40 a human antibody domain, such as one comprising Cyl. A 
genetically diverse population of chimaeric second 
polypeptide chains may be used in step (ii) of the 
method. Each of said population of second polypeptide 
chains combined in step (iv) may be a human chain which 

45 comprises an imposed complementarity determining region 
(CDR) from a non-human animal antibody specific for said 
antigen. If said first polypeptide chains are 
immunoglobulin light chains and said second polypeptide 
chains are immunoglobulin heavy chains. Then it may be 

50 beneficial in a selection of a high specificity humanised 
antibody for the imposed CDR to be CDR3 . 

The invention encompasses kits for use in carrying 
out a method according to any aspect of the invention. A 
kit may have the following components in additional to 

55 ancillary components required for carrying out the 
method : 
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(i) a vector having the following features: 
(a) an origin of replication for single-stranded 
bacteriophage, (b) a restriction site for insertion of 
nucleic acid encoding or a polypeptide component of an 
5 sbp member, (c) said restriction site being in the 5' end 
region of the mature coding sequence of a phage capsid 
protein, and (d) with a secretory leader sequence 
upstream of said site which directs a fusion of the 
capsid protein and sbp polypeptide to the periplasmic 

10 space of a bacterial host; and (ii) another vector, 

having some or all of the features (a), (b) , (c) and (d) 
of the vector described in ( i ) . 

Another kit for use in carrying out a method 
according to one aspect of the invention may have the 

15 following components in addition to ancillary components 
required for carrying out the method: 

(i) a first vector having the following features: 
(a) a restriction site for insertion of nucleic acid 
encoding or a polypeptide component of an sbp member, (b) 

20 said restriction site being in the 5 1 end region of the 
mature coding sequence of a phage capsid protein, and (c) 
with a secretory leader sequence upstream of said site 
which directs a fusion of the capsid protein and sbp 
polypeptide to the periplasmic space of a bacterial host; 

25 and 

(ii) a second vector having a restriction site for 
insertion of nucleic acid encoding a second said 
polypeptide chain, 

(iii) at least one of the vectors having an origin of 
30 replication for single-stranded bacteriophage, and 

(iv) the vectors having sequences at which site- 
specific recombination will occur. 

In the above methods, the binding molecule may be 
an antibody, or a domain that is homologous to an 

35 immunoglobulin. The antibody or domain may be either 

naturally derived or synthetic or a combination of both. 
The domain may be a Fab, scFv, Fv dAb or Fd molecule. 
Alternatively, the binding molecule may be an enzyme or 
receptor or fragment, derivative or analogue of any such 

40 enzyme or receptor. Alternatively, the binding molecule 
may be a member of an immunoglobulin superfamily and 
which has a structural form based on an immunoglobulin 
molecule . 

The present invention also provides rgdps as 
45 defined above and members of specific binding pairs eg. 
binding molecules such as antibodies, enzymes, receptors, 
fragments and derivatives thereof, obtainable by use of 
any of the above defined methods. The derivatives may 
comprise members of the specific binding pairs fused to 
50 another molecule such as an enzyme or a Fc tail. 

The invention also includes kits for carrying out 
the methods hereof. The kits will include the necessary 
vectors. One such vector will typically have an origin 
of replication for single stranded bacteriophage and 
55 either contain the sbp member nucleic acid or have a 

restriction site for its insertion in the 5' end region 
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of the mature coding sequence of a phage capsid protein, 
and with a secretory leader coding sequence upstream of 
said site which directs a fusion of the capsid protein 
exogenous polypeptide to the periplasraic space. 
5 ' The restriction sites in the vectors are 

preferably those of enzymes which cut only rarely in 
protein coding sequences. 

The kit preferably includes a phagemid vector 
which may have the above characteristics, or may contain, 
10 or have a site for insertion, of sbp member nucleic acid 
for expression of the encoded polypeptide in free form. 

The kits will also contain ancillary components 
required for carrying out the method, the nature of such 
components depending of course on the particular method 

15 employed. . . , 

Useful ancillary components may comprise helper 
phage, PCR primers, and buffers and enzymes of various 
kinds . 

PCR primers and associated reagents for use where 
20 the sbp members are antibodies may have the following 
characteristics: . r ^ 

(i) primers having homology to the 5' end of the sense 
or anti-sense strand of sequences encoding domains 
of antibodies; and 
25 (ii) primers including tag sequences 5' to these 

homologous sequences which incorporate restriction 
sites' to allow insertion into vectors; together 
with sequences to allow assembly of amplified VH 
and VL regions to enable expression as Fv, scFv or 
30 Fab fragments. 

Also comprehended by the present invention is the 
provision of an intermediate product of a dual 
combinational method, comprising a selected or partially 
selected mixed population of vectors or specific binding 
35 pair members, such as antibodies, which can then be used 
in a further method of combination and selection. 

Buffers and enzymes are typically used to enable 
preparation of nucleotide sequences encoding Fv, scFv or 
Fab fragments derived from rearranged or unrearranged 
40 immunoglobulin genes according to the strategies 
described herein. 

The applicants have chosen the filamentous F- 
specific bacteriophages as an example of the type of 
phage which could provide a vehicle for the display of 
45 binding molecules e.g. antibodies and antibody fragments 
and derivatives thereof, on their surface and facilitate 
subsequent selection and manipulation. 

The F-specific phages (e.g. fl, fd and M13) have 
evolved a method of propagation which does not kill the 
50 host cell and they are used commonly as vehicles for 
recombinant DNA (Romberg, A., DNA Replication, W.H. 
Freeman and Co., San Francisco, 1980). The single 
stranded DNA genome (approximately 6.4 Kb) of fd is 
extruded through the bacterial membrane where it 
55 sequesters capsid sub-units, to produce mature virions. 
These virions are 6 nm in diameter, 1pm in length and 
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each contain approximately 2,800 molecules of the major 
coat protein encoded by viral gene VIII and four 
molecules of the adsorption molecule gene III protein 
(g3p) the latter is located at one end of the virion. 
5 The structure has been reviewed by Webster et al. ; 1978 
in The Single Stranded DNA Phages, 557-569, Cold Spring 
Harbor Laboratory Press. The gene III product is 
involved in the binding of the phage to the bacterial F- 
pilus • 

10 Although these phages do not kill their host 

during normal replication, disruption of some of their 
genes can lead to cell death (Kornberg, A., 1980 supra.) 
This places some restraint on their use. The applicants 
have recognized that gene III of phage fd is an 

15 attractive possibility for the insertion of biologically 
active foreign sequences. There are however, other 
candidate sites including for example gene VIII and gene 
VI. 

The protein itself is only a minor component of 

20 the phage coat and disruption of the gene does not lead 
to cell death (Smith, G. 1988/ Virology 167: 156-165). 
Furthermore, it is possible to insert some foreign 
sequences (with no biological function) into various 
positions within this gene (Smith, G. 1985 Science 228 : 

25 1315-1317., Parmley, S.F. and Smith, G.P. Gene: 73 (1988) 
p. 305-318., and de la Cruz, V.F., et al., 1988, J. Biol. 
Chem. , 263 : 4318-4322). Smith et al described the 
display of peptides on the outer surface of phage but 
they did not describe the display of protein domains. 

30 Peptides can adopt a range of structures which can be 

different when in free solution, than when bound to, for 
example, an antibody, or when forming part of a protein 
(Stanfield, R.I. et al., (1990) Science 248 . p712-7i9). 
Proteins in general have a well defined tertiary 

35 structure and perform their biological function only when 
adopting this structure. For example, the structure of 
the antibody D1.3 has been solved in the free form and 
when bound to antigen (Bhat, T.N. et al., (1990) Nature 
347, p483-485). The gross structure of the protein is 

40 identical in each instance with only minor variations 

around the binding site for the antigen. Other proteins 
have more substantial conformation 1 changes on binding of 
ligand, for instance the enzymes hexokinase and pyruvate 
dehydrogenase during their catalytic cycle, but they 

45 still retain their overall pattern of folding. This 

structural integrity is not confined to whole proteins, 
but is exhibited by protein domains. This leads to the 
concept of a folded unit which is part of a protein, 
often a domain, which has a well defined primary, 

50 secondary and tertiary structure and which retains the 

same overall folding pattern whether binding to a binding 
partner or not. The only gene sequence that Smith et 
al., described that was of sufficient size to encode a 
domain (a minimum of perhaps 50 amino acids) was a 335bp 

55 fragment of a B-galactosidase corresponding to 

nucleotides 861-1195 in the B-galactosidase gene sequence 
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(Pannley, S. + Smith, G.P. 1988 supra. This would encode 
112 amino acids of a much larger 380 amino acid domain. 
Therefore, prior to the present application, no 
substantially complete domain or folded unit had been 
5 displayed on phage. In these cases, although the 

infectivity of the virion was disrupted, the inserted 
sequences could be detected on the phage surface by use 
of e.g. antibodies. 

The protein encoded by gene III has several 

10 domains (Pratt, D., et al., 1969 Virology ||: 42-53. , 
Grant, R.A., et al., 1981, J. Biol. Chem. 2||: 539-546 
and Armstrong, J., et al., FEBS Lett. 135: 167-172 1981.) 
including: (i) a signal sequence that directs the protein 
to the cell membrane and which is then cleaved off; (") 

15 a domain that anchors the mature protein into the 

bacterial cell membrane (and also the phage coat); and 
(iii) a domain that specifically binds to the phage 
receptor, the F-pilus of the host bacterium. Short 
sequences derived from protein molecules have been 

20 inserted into two places within the mature molecule 
(Smith, G., 1985 supra., and Parmley, S.F. and Smith 
G.P., 1988 supra.). Namely, into an inter-domain region 
and also between amino acids 2 and 3 at the N-terminus. 
The insertion sites at the N-terminus were more 

25 successful in maintaining the structural integrity of the 
gene III protein and displaying the peptides on the 
surface of the phage. By use of antisera specific for 
the peptides, the peptides inserted into this position 
were shown to be on the surface of the phage. These 

30 authors were also able to purify the phage, using this 
property. However, the peptides expressed by the phage, 
did not possess measurable biological functions of their 

own. , . 

Retaining the biological function of a molecule 

35 when it is expressed in a radically different context to 
its natural state is difficult. The demands on the 
structure of the molecule are heavy. In contrast, 
retaining the ability to be bound by specific antisera is 
a passive process which imposes far less rigorous demands 

40 on the structure of the molecule. For example, it is the 
rule rather than the exception that polyclonal antisera 
will recognise totally denatured, and biologically 
inactive, proteins on Western blots (see for example, 
Harlow, E. and Lane, D., Antibodies, a Laboratory Manual, 

45 Cold Spring Harbor Laboratory Press 1988). Therefore, 

the insertion of peptides into a region that allows their 
structure to be probed with antisera teaches only that 
the region allows the inserted sequences to be exposed 
and does not teach that the region is suitable for the 

50 insertion of large sequences with demanding structural 
constraints for the display of a molecule with a 
biological or binding function. In particular, it does 
not teach that domains or folded units of proteins can be 
displayed from sequences inserted in this region. 

55 This experience with Western blots is a graphic 

practical demonstration which shows that retaining the 
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ability to be bound by specific antisera imposes far less 
rigorous demands on the structure of a ^JJ^tlflJ. than 
does folding for the retention of a biological faction. 
Studies have been carried out, in which E.coli 
5 have been manipulated to express the protein B-adrenergic 
receptor as a fusion with the outer membrane protein 
lamB. The B-adrenergic receptor was expressed in a 
functional form as determined by the presence of b ^ing 
aSivity. However, when an equivalent antibody fuj"" 
10 was made with lamB, the antibody fusion was toxic to the 

h ° St Ce £he applicants have investigated the possibility 
of inserting the gene coding sequence for biologically 
actSe antibody fragments into the gene III region of fd 

II express a large fusion protein. As ^parent fr om 
the previous discussion, this approach makes onerous 
demands on the functionality of the fusion protein. The 
insertion is large, encoding antibody fragments of at 
least 100-200 amino acids; the antibody derived domain 
must fold efficiently and correctly to display antigen- 
binding; and most of the functions of gene III must be 
retained. The applicants approach to the . con ;*^? n 0 f 
the fusion molecule was designed to minimise the risk of 
disrupting these functions. In an embodiment of the 

25 invention, the initial vector used was fd-tet (Zacher, 
A.M., et al., 1980, Gene 9, 127-140) a tetracycline 
resistant version of fd bacteriophage that can be 

propagated as a plasmid that confers tetracycline 

resistance to the infected E.coli host. The applicants 
chose to insert after the signal sequence of the f d gene 

III protein for several reasons. In particular, tne 
applicants chose to insert after amino acid 1 of the 
mature protein to retain the context for the signal 
peptidase cleavage. To retain the structure and function 
of gene III itself, the majority of the original amino 
acids are synthesized after the inserted immunoglobulin 
sequences. The inserted immunoglobulin sequences were 
designed to include residues from the switch region that 
links VH-VL to CH1-CL (Lesk, A., and Chothia, C. , Nature 

40 335, 188-190, 1988). 

Surprisingly, by manipulating gene III ot 
bacteriophage fd, the present applicants have been able 
to const?ucl a bacteriophage that displays on its surface 
large biologically functional antibody, enzyme, and 

45 receptor molecules whilst remaining intact and 

infectious. Furthermore, the phages bearing antibodies 
of desired specificity, can be selected from a background 
of phages not showing this specificity. 

The sequences coding for a population of antibody 

50 molecules and for insertion into the vector to give 
expression of antibody binding functions on the Phage 
surface can be derived from a variety of sources. For 
example, immunised or non-immunised rodents or humans, 
and from organs such as spleen and peripheral blood 

55 lymphocytes The coding sequences are derived from these 
sources by techniques familiar to those skilled in the 
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art (Orlandi, R-, et al., 1989 supra; Larrick J.W., et 
al., 1989 supra; Chiang, Y.L., et al., 1989 Bio 
Techniques 7, p. 360-366; Ward, E.S, et al., 1989 supra; 
Sastry, L. , et al., 1989 supra.) . . . ^ . 

5 The disclosure made by the present applicants is 

important and provides a significant breakthrough in the 
technology relating to the production of biological 
bfndSng molecules, their fragments and derivatives by the 
use of recombinant methods. 

10 m standard recombinant techniques for the 

production of antibodies, an expression vector containing 
Sequences coding for the antibody polypeptide chains is 
used to transform e.g. E.coli. The antibody polypeptides 
are expressed and detected by use of standard screening 

15 system?. When the screen detects an antibody polypeptide 
of the desired specificity, one has to return to the 
particular transformed E.coli expressing the desired 
antibody polypeptide. Furthermore, the vector containing 
the coding sequence for the desired antibody polypeptide 

20 then has to be isolated for use from E.coli in further 

processing steps. , . . 

In the present invention however, the desired 
antibody polypeptide when expressed, is already Packaged 
with iti gene coding sequence. This means- that when the 

25 an antibody polypeptide of desired specificity is 
selected, there is no need to return to the original 
culture for 'isolation of that sequence. Furthermore, in 
previous methods in standard recombinant techniques, each 
clone expressing antibody needs to be screened 

30 individually. The present application provides for the 
selection of clones expressing antibodies with desired 
properties and thus only requires screening of clones 
from an enriched pool. . _ 

Because a rgdp (eg a pAb) displays a member of a 

35 specific binding pair (eg. an antibody of monoclonal 
antigen-binding specificity) at the surface of a 
relatively simple replicable structure also containing 
the genetic information encoding the member, rgdps eg 
DAbs that bind to the complementary member of the 

40 Specific binding pair (eg antigen) can be recovered very 
efficiently by either eluting off the complementary 
member using for example diethylamine, high salt etc and 
infecting suitable bacteria, or by denaturing the 
structure, and specifically amplifying the sequences 

45 encoding the member using PCR. That is, there is no 

necessity to refer back to the original bacterial clone 

that gave rise to the pAb. t*-*,*-*** 
For some purposes, for example immunoprecipitation 
and some diagnostic tests, it is advantageous to use 

50 polyclonal antibodies or antibody fragments. The present 
invention allows this to be achieved by either selection 
of an enriched pool of pAbs with desired properties or by 
mixing individually isolated clones with desired 
properties. The antibodies or antibody fragments may 

55 then be expressed in soluble form if desired. Such a 
selected polyclonal pAb population can be grown from 



WO 92/20791 



PCT/GB92/00883 



27 

stocks of phage, bacteria containing phagemids or 
bacteria expressing soluble fragments derived from the 
selected polyclonal population. Thus a reagent 
equivalent to a polyclonal antiserum is created which can 
5 be replicated and routinely manufactured in culture 
without use of animals. 

SELECTION FORMATS AND AFFINITY MATURATION 

Individual rgdps eg pAbs expressing the desired 
specificity eg for an antigen, can be isolated from the 
10 complex library using the conventional screening 

techniques (e.g. as described in Harlow, E., and Lane, 
D., 1988, supra Gherardi, E et al. 1990. J. Immunol, 
meth. 126 p61-68). 

The applicants have also devised a series of novel 
15 selection techniques that are practicable only because of 
the unique properties of rgdps. The general outline of 
some screening procedures is illustrated in figure 15 
using pAbs as an example type of rgdp. 

The population/library of pAbs to be screened 
20 could be generated from immunised or other animals; or be 
created in vitro by mutagenising pre-existing phage 
antibodies (using techniques well-known in the art such 
as oligonucleotide directed mutagenesis (Sambrook, J., et 
al., 1989 Molecular Cloning a Laboratory Manual, Cold 
25 Spring Harbor Laboratory Press). This population can be 
screened in one or more of the formats described below 
with reference to figure 15, to derive those individual 
pAbs whose antigen binding properties are different from 
sample c. 
30 Binding Elution 

Figure 15(i) shows antigen (ag) bound to a solid 
surface (s) the solid surface (s) may be provided by a 
petri dish, chromatography beads, magnetic beads and the 
like. The population/library of pAbs is then passed over 
35 the ag, and those individuals p that bind are retained 
after washing, and optionally detected with detection 
system d. A detection system based upon anti-fd antisera 
is illustrated in more detail in example 4 of W0 
92/01047. If samples of bound population p are removed 
40 under increasingly stringent conditions, the binding 
affinity represented in each sample will increase. 
Conditions of increased stringency can be obtained, for 
example, by increasing the time of soaking or changing 
the pH of the soak solution, etc. 
45 Competition 

Referring to figure 15 (ii) antigen ag can 
be bound to a solid support s and bound to saturation by 
the original binding molecule c. If a population of 
mutant pAb (or a set of unrelated pAbs) is offered to the 

50 complex, only those that have higher affinity for antigen 
ag than c will bind. In most examples, only a minority 
of population c will be displaced by individuals from 
population p. If c is a traditional antibody molecule, 
all bound material can be recovered and bound p recovered 

55 by infecting suitable bacteria and/or by use of standard 
techniques such as PCR. 
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An advantageous application is where ag xs used as 
a receptor and c the corresponding ligand. The recovered 
bound population p is then related structurally to the 
receptor binding site/and or ligand. This type of 
5 specificity is known to be very useful in the 

pharmaceutical industry* 

Another advantageous application is where ag is an 
antibody and c its antigen. The recovered bound 
population p is then an anti-idiotype antibody which have 

10 numerous uses in research and the diagnostic and 

pharmaceutical industries. . 

At present it is difficult to select directly for 
anti-idiotype antibodies. pAbs would give the ability to 
do this directly by binding pAb libraries (eg a naive 

15 library) to B cells (which express antibodies on their 
surface) and isolating those phage that bound well. 

In some instances it may prove advantageous to 
pre-select population p. For example, in the anti- 
idiotype example above, p can be absorbed against a 

20 related antibody that does not bind the antigen. 

However, if c is a pAb, then either or both c and 
p can advantageously be marked in some way to both 
distinguish and select for bound p over bound c. This 
marking can be physical, for example, by pre-labellmg p 

25 with biotin; or more advantageously, genetic. For 

example c can be marked with an EcoB restriction site, 
whilst t> can be marked with an EcoK restriction site (see 
Carter, P. et al., 1985, Nucl. Acids Res. 13, 4431-4443). 
When bound p+c are eluted from the antigen and used to 

30 infect suitable bacteria, there is restriction (and thus 
no growth) of population c (i.e. EcoB restricting 
bacteria in this example). Any phage that grew, would be 
greatly enriched for those individuals from p with higher 
binding affinities. Alternatively, the genetic marking 

35 can be achieved by marking p with new sequences, which 
can be used to specifically amplify p from the mixture 

using p ^J^ ce bQund pAbs can be amplified using for 
example PCR or bacterial infection, it is also possible 
40 to rescue the desired specificity even when insufficient 
individuals are bound to allow detection via conventional 

techniques. , 

The preferred method for selection of a phage 
displaying a protein molecule with a desired specificity 

45 or affinity will often be elution from an affinity matrix 
with a ligand (eg example 21 of WO 92/01047). Elution 
with increasing concentrations of ligand should elute 
phage displaying binding molecules of increasing 
affinity. However, when eg a pAb binds to its antigen 

50 with high affinity or avidity (or another protein to its 
binding partner) it may not be possible to elute the pAb 
from an affinity matrix with molecule related to the 
antigen. Alternatively, there may be no suitable 
specific eluting molecule that can be prepared in 

55 sufficiently high concentration. In these cases it is 
necessary to use an elution method which is not specific 
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to eg the antigen-antibody complex. Some of the non- 
specific elution methods generally used reduce phage 
viability for instance , phage viability is reduced with 
time at pH12 (Rossomando, E.F. and Zinder N.D. J. 
5 Mol.Biol. 36 387-399 1968). There may be interactions 
between eg antibodies and affinity matrices which cannot 
be disrupted without completely removing phage 

* infectivity. In these cases a method is required to 
elute phage which does not rely on disruption of eg the 

10 antibody - antigen interaction. A method was therefore 

* devised which allows elution of bound pAbs under mild 
conditions (reduction of a di thiol group with 
dithiothreitol ) which do not disrupt phage structure 
(example 47 of WO 92/01047). 

15 This elution procedure is just one example of an 

elution procedure under mild conditions. A particularly 
advantageous method would be to introduce a nucleotide 
sequence encoding amino acids constituting a recognition 
site for cleavage by a highly specific protease between 
20 the foreign gene inserted, in this instance a gene for an 
antibody fragment, and the sequence of the remainder of 
gene III. Examples of such highly specific proteases are 
Factor X and thrombin. After binding of the phage to an 
affinity matrix and elution to remove non-specific 
25 binding phage and weak binding phage, the strongly bound 
phage would be removed by washing the column with 
protease under conditions suitable for digestion at the 
cleavage site. This would cleave the antibody fragment 
from the phage particle eluting the phage. These phage 
30 would be expected to be infective, since the only 

protease site should be the one specifically introduced. 
Strongly binding phage could then be recovered by 
infecting eg. E.coli TGI cells. 

An alternative procedure to the above is to take 
35 the affinity matrix which has retained the strongly bound 
pAb and extract the DNA, for example by boiling in SDS 
solution. Extracted DNA can then be used to directly 
transform E.coli host cells or alternatively the antibody 
encoding sequences can be amplified, for example using 
40 PCR with suitable primers such as those disclosed herein, 
and then inserted into a vector for expression as a 
soluble antibody for further study or a pAb for further 
rounds of selection. 

Another preferred method for selection according 
45 to affinity would be by binding to an affinity matrix 
containing low amounts of ligand. 

If one wishes to select from a population of 
phages displaying a protein molecule with a high affinity 
for its ligand, a preferred strategy is to bind a 
50 population of phage to an affinity matrix which contains 
a low amount of ligand. There is competition between 
phage, displaying high affinity and low affinity 
proteins, for binding to the ligand on the matrix. Phage 
displaying high affinity protein is preferentially bound 
55 and low affinity protein is washed away. The high 

affinity protein is then recovered by elution with the 
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ligand or by other procedures which elute the phage from 
thi affinity matrix (example 35 of WO 92/01047 
demonstrates this procedure). n „ kaoed DNA 

In summary then, for recovery of the packaged DNA 
from the affinity step, the package can be simply eluted, 
it can be eluted in the presence of a homologous sbp 
member which competes with said package for binding to a 
complementary sbp member; it could be removed by boiling, 
it could be removed by proteolytic cleavage of the 
protein; and other methods will be apparent to those 
Skilled in the art eg. destroying the link between the 
substrate and complementary sbp member to release said 
packaged DNA and sbp member. At any rate the objective 
is to obtain the DNA from the package so that it can be 
15 used directly or indirectly, to express the sbp member 
encoded thereby. , . _ 

The efficiency of this selection procedure for 
pAbs and the ability to create very large libraries means 
that the immunisation techniques developed to increase 
20 the proportion of screened cells producing antibodies of 
interest will not be an absolute requirement. The 
technique allows the rapid isolation of binding 
specificities eg antigen-binding specificities, including 
those that would be difficult or even unobtainable by 
25 conventional techniques, for example, ^f^f^LSS" 
idiotypic antibodies. Removal of the animal altogether 
is now possible, once a complete library of the immune 
repertoire has been constructed. , . niilllhor 

The structure of the pAb molecule can be used in a number 
of other applications, some examples of which are: 
Signal Am plification an 
Acting as a molecular entity in itself, rgdps eg 
pAbs combine the ability to bind a specific molecule eg 
antigen with amplification, if the major coat protein is 
used to attach another moiety. This moiety can be 
attached via immunological, chemical, or any other means 
and can be used, for example, to label the complex with 
detection reagents or cytotoxic molecules for use in vivo 
or in vitro . 
40 Physical Detection 

The size of the rgdps eg pAbs can be used as a 
marker particularly with respect to physical methods of 
detection such as electron microscopy and/or some 
biosensors, e.g. surface plasmon resonance. 
45 Diagnostic Assays 

The rgdps eg pAbs also have advantageous uses in 

diagnostic assays, particularly where separation can be 
effected using their physical properties for example 
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centrifugation, filtration etc. 

In order that the invention is more fully 
understood, embodiments will be described in more detail 
by way of example only and not by way of limitation with 
reference to the figures described below. 

BRIEF DESCRIPTION O F THE FIGURES 

Fig . i shows plots of the probability of isolating 
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an antibody with a given p[K] value against the size of a 
library. 

Fig. 2 outlines a strategy to clone heavy chain as 
g3 fusion on phage, light change being expressed as 
5 soluble fragments from a phagemid. 

Fig. 3 shows a cloning strategy wherein only one 
of the two replicons is capable of being packaged into a 
filamentous particle. 

Fig. 4 shows various possible combinations of 
10 heavy and light chains, gene 3 fusions, and replicons in 
polycombinantorial libraries . 

Fig. 5 shows a strategy for cloning heavy chains 
as g3 fusions on phage with combination with purified 
light chains in vitro. 
15 ^ Fig. 6 illustrates the use of sites specific 
recombination for construction of polycombinantorial 
libraries. 

Fig. 7 shows the sequence of the template clone 
used in example 1. This is Aab NQ10.12.5 (Hoogenboom et 

20 al 1991, supra). 

Fig. 8 illustrates a strategy for cloning heavy 
and light chains as separate elements. 

Fig. 9 shows the sequence of polylinker used in 
pUC19 and pUC119 derivatives in example 1. 
25 Fig. 10 shows the results of the infection 

experiments described in example 1, illustrating 
interference" between phage and phagemid vectors. 

Fig. 11 illustrates the effect on ELISA signal of 
interference between phage and phagemid vectors, compared 
30 with phage and plasmid. 

Fig. 12 shows ELISA results which show that only 
when the correct heavy and light chain combination is 
used is a functional antibody produced, as demonstrated 
in example 3. 

35 Fig. 13 shows an example of a scheme for 

humanising a mouse monoclonal antibody. 

Fig. 14 shows the basic structure of the simplest 
antibody molecule IgG. 

Fig. 15 shows schematically selection techniques 

40 which utilise the unique properties of pAbs; 15(i) shows 
a binding/elution system; and 15(ii) shows a competition 
system (p=pAb; ag=antigen to which binding by pAb is 
required; c=competitor population e.g. antibody, pAb, 
ligands; s=substrate (e.g. plastic beads etc); 

45 d=detection system. 

Fig. 16 shows the sequence around the cloning site 
in gene III of fd D0G1. Restriction enzyme sites are 
shown as well as the amino acids encoded by antibody 
derived sequences. These are flanked at the 5* end by 

50 the gene III signal peptide and at the 3' end by 3 

alanine residues (encoded by the Not 1 restriction site) 
and the remainder of the mature gene III protein. The 
arrow shows the cleavage site for cutting of the signal 
peptide. 

55 Fig. 17 shows a) the phagemid pHENl a derivative 

of pUC119 described in example 24; and b) the cloning 
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sites in the phageraid pHEN. 

Fig. 18. The antibody constructs cloned into fd-DOGl 
and pHBNl for display on the surface of phage. 
5 Constructs I, II, III and IV were cloned into both fd- 
DOGl (as ApaLI-NotI fragments) and pHENl (as Sfil-NotI 
fragments) and pHENl (as Sfil-NotI fragments). All the 
constructs contained the heavy chain (VH) and light chain 
(VK) variable regions of the mouse anti-phOx antibody 
10 NQ10.12.5. The constant domains were human CK and CHI 
(tf 1 isotype). 

Fig. 19. Three ways of displaying antibody 
fragments on the surface of phage by fusion to gene III 
15 protein. 

Disclosed here are methods for preparing extremely 
diverse libraries of antibody heavy and light chains. 
Heavy and light chains are cloned on separate replicons 

20 and functional antibody produced by post-translational 
assembly of heavy and light chains in vivo or in vitro, 
such that the final number of combinations created is the 
number of heavy chains multiplied by the number of light 
chains. Such a format is also convenient for chain- 

25 shuffling, mutagenesis, humanising and CDR 'imprinting'. 
These methods can also be applied to other proteins in 
which two or more different subunits assemble to create a 
functional oligomer. 

30 The first functional antibody molecules to be 

expressed on the surface of filamentous phage were 
single-chain Fv's (scFv), so-called because heavy and 
light chain variable domains, normally on two separate 
proteins, are covalently joined by a flexible linker 

35 peptide. Alternative expression strategies have also 
been successful. Monomeric Fab molecules can be 
displayed on phage if one of the chains (heavy or light) 
is fused to g3 capsid protein and the complementary chain 
exported to the periplasm as a soluble molecule. The two 

40 chains can be encoded on the same or on different 
replicons; the important point is that the two antibody 
chains assemble post-translationally and the dimer is 
incorporated into the phage particle via linkage of one 
of the chains to g3p. 

45 

More recent cloning experiments have been performed 
with 'phagemid' vectors which have ca. 100-fold higher 
transformation efficiencies than phage DNA. These are 
plasmids containing the intergenic region from 
50 filamentous phages which enables single-stranded copies 
of the phagemid DNA to be produced, and packaged into 
infectious filamentous particles when cells harbouring 
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them are infected with 'helper 1 phages providing the 
phage components in trans . When phagemids contain gill 
fused to an antibody gene (e.g. pHEN-1), the resulting 
fusion protein is displayed on the phagemid particle 
5 (Hoogenboom, H.R. , A.D. Griffiths, K.S. Johnson, D.J. 
Chiswell, P. Hudson and G. Winter. (1991). Multi-subunit 
proteins on the surface of filamentous phage: 
methodologies for displaying antibody (Fab) heavy and 
light chains. Nucleic Acids Res . 19 (15), 4133-4137). 
10 Considerable progress has been made in developing 
efficient strategies for cloning antibody genes, a factor 
which becomes most important when dealing with large 
numbers of different antibody fragments such as 
repertoires • 

15 

The cloning vector fd-DOG-1 was used in early work 
with phage antibody repertoires in which scFv fragments 
were derived from spleen m RNA of mice immunised with the 
hapten oxazalone (Clackson, T., H.R. Hoogenboom, A.D. 

20 Griffiths and G. Winter. (1991). Making antibody 
fragments using phage display libraries. Nature 352, 624- 
628); VH and VL domains were separately amplified then 
linked at random via a short DNA fragment encoding the 
scFv linker peptide to produce a library of approximately 

25 10 5 different clones. This was panned against the 
immunising "antigen to select combinations of VH and VL 
which produced functional antibodies. Several binders 
were isolated, one in particular having an affinity not 
far below that of the best monoclonal anitbodies produced 

30 by conventional hybridoma technology. 

In a mouse, at any one time there are approximately 
10 7 possible H chains and 10 5 possible L chains, making a 
total of 10 12 possible VH:VL combinations when the two 

35 chains are combined at random (these figures are 
estimates and simply provide a rough guide to repertoire 
size). By these figures, the above mouse library sampled 
only 1 in 10 7 of the possible VH:VL combinations. It is 
likely that good affinity antibodies were isolated 

40 because the spleen cells derived from an immunised donor, 
in which B cells capable of recognising the antigen are 
clonally expanded and producing large quantities of Ig 
mRNA. The low library complexity in this experiment is 
partly due to the intrinsically low transformation 

45 efficiency of phage DNA compared to plasmid (or 
phagemid ) . 

Marks et al. (Marks, J.D. Hoogenboom, H.R., Bonnert, 
T.P., McCafferty, J., Griffiths, A.D. and Winter, G. 
50 (1991) By-passing immunization: Human antibodies from V- 
gene libraries displayed on phage. J.Mol.Biol. 222, 581- 
597) and PCT/GB91/01134 describe construction of an 
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antibody repertoire from unimmunised humans cloned in the 
phagemid pHEN-1. This library, consisting of 3.10' 
clones has so far yielded specific antibodies to ten 
different antigens. These antibodies have the moderate 
5 affinities expected of a primary immune response, 
demonstrating that usable antibodies to a range of 
structurally diverse antigens can indeed be isolated from 
a single resource. 

10 New binders can be created from clones isolated from 

phage antibody libraries using a procedure called 'chain- 
shuffling'. In this process one of the two chains is 
fixed and the other varied. For example, by fixing the 
heavy chain from the highest affinity mouse anti-OX phage 
15 antibody and recloning the repertoire of light chains 
alongside it, libraries of 4.10 7 were constructed. 
Several new OX-binders were isolated, and the majority of 
these had light chains that were distinct from those 
first isolated and considerably more diverse. These 
observations reflect the fact that a small library is 
sufficient to tap the available diversity when only one 
chain is varied, a useful procedure if the original 
library was not sufficiently large to contain the 
available diversity. 

The size of the library is of critical importance. 
This is especially true when attempting to isolate 
antibodies from a naive human repertoire, but is equally 
relevant to isolation of the highest affinity antibodies 
30 from an immunised source. 

It is clear that while phage display is an 
exceptionally powerful tool for cloning and selecting 
antibody genes, we are tapping only the tiniest fraction 

35 of the potential diversity using existing technology. 
Transformation efficiencies place the greatest limitation 
on library size with 10 9 being about the limit using 
current methods. Rough calculations suggest that this is 
several orders of magnitude below the target efficiency; 

40 more rigourous analysis confirms it. 

Perelson and Oster have given theoretical 
consideration to the relationship between size of the 
immune repertoire and the likelihood of generating an 
antibody capable of recognising a given epitope with 
greater than a certain threshold affinity, K. The 
relationship is described by the equation: 



20 



25 



45 



50 



P=e -N(p[K]) 

where P- probability that an epitope is not 
recognised with an affinity above the threshold 
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value K by any antibody in the repertoire- 

N= number of different antibodies in the 
repertoire. 

5 

and p[K]= probability that an individual antibody 
recognises a random epitope with an affinity 
above the threshold value K. 



10 In this analysis p[K] is inversely proportional to 

affinity, although an algorithm describing this 
relationship precisely has not been deduced. Despite 
this, it is apparent that the higher the affinity of the 
antibody, the lower its p[K] and the larger the 

15 repertoire needs to be to achieve a reasonable 
probability of isolating the antibody. The other 
important feature is that the function is exponential; as 
shown in fig.l, a small change in library size can have 
either a negligible or a dramatic effect on the 

20 probability of isolating an antibody with a given p[K] 
value, depending upon what point on the curve is given by 
the library size. 

The applicants have realised that the limitations of 
25 transformation efficiency (and therefore the upper limit 
on library 'size) can be overcome by efficient methods of 
introducing DNA into cells. In the preferred 
configuration, heavy and light chain genes are cloned 
separately on two different replicons, at least one of 
30 which is capable of being incorporated into a filamentous 
particle. Infectious particles carrying one chain are 
infected into cells harbouring the complementary chain; 
infection frequencies of >90% can be readily achieved. 
Heavy and light chains are then able to associate post- 
35 translationally in the periplasm and the combination 
displayed on the surface of the filamentous particle by 
virtue of one or both chains being connected to g3p. For 
example, a library of 10 7 heavy chains is cloned as an 
unfused population in a phagemid, and 10 7 light chains 
40 are cloned as g3 fusions in fd-DOG-1. Both populations 
are then expanded by growth such that there are 10 7 of 
each heavy chain-containing cell and 10 7 copies of each 
light chain phage. By allowing the phage to infect the 
cells, 10 7 x 10 7 = 10 14 unique combinations can be 
45 created, because there are 10 7 cells carrying the same 
heavy chain which can each be infected by 10 ' phage 
carrying different light chains. When this is repeated 
for each different heavy chain clone then one ends up 
with up to 10 14 different heavy/light combinations in 
50 different cells. This strategy is outlined in fig. 2, 
which shows the heavy chain cloned as g3 fusions on phage 
and the light chains expressed as soluble fragments from 
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a phagemid. Clearly, the reverse combination, light 
chains on phage, heavy chain on phagemid, is also 
tenable. 

5 In this configuration shown in fig. 2, fd-DOG 

•rescues' the phagemid so that both phage and phagemid 
DNA is packaged into filamentous particles, and both 
types will have paired heavy and light chains on their 
surface, despite having the genetic information for only 

10 one of them. For a given antigen or epitope, the vast 
majority of the heavy and light chain pairings will be 
non- functional, so that selection on antigen will have 
the effect of vastly reducing the complexity of the heavy 
and light chain populations. After the first round of 

15 selection the clones are re-assorted, in this example by 
infecting fresh host cells and selecting for both 
replicons. After several rounds of antigen selection and 
recovery of the two replicons, the considerably reduced 
heavy and light chain populations can be cloned onto the 

20 same replicon and analysed by conventional means. One 
technical problem with this arrangement is so-called 
'interference' between filamentous phage origins of 
replication carried on different replicons as a result of 
competition for the same replication machinery. This 

25 problem can be ameliorated by construction of 
•interference-resistant' mutants of either phage and/or 
phagemid origins (Johnston, S. and Ray, D.S. Interference 
between M13 and oriMl3 plasmids is mediated by a 
replication enhancer sequence near the viral strand 

30 origin. (1984) J.Mol.Biol. 177, 685-700) or through 
control of copy number e.g. by replacing the origin of 
double-stranded replication on the phagemid (distinct 
from filamentous phage intergenic region) with that of, 
for example, a temperature sensitive runaway replicon. 

35 in this way the copy number of the resident phagemid can 
be kept down to minimise interference so that the phage 
can establish, then the phagemid copy number allowed to 
increase for expression of the antibody. 

40 Alternatively, only one of the two replicons need be 

capable of being packaged into a filamentous particle. 
Such a strategy is outlined schematically in fig. 3 and 
reduced to practice in example 2. A library of light 
chains is cloned in the plasmid pUC19 and the heavy 

45 chains are expressed as g3 fusions in fd-DOG-l. There is 
no interference in this case since the replication 
mechanisms are distinct. The main operational difference 
here is that the process results in selection of, in this 
case, the best heavy chains; the light chains are not 

50 cloned. The appropriate light chains are isolated later 
when the selected heavy chains are cloned together with 
the repertoire of light chains on the same replicon, then 
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selected conventionally. 

Again, this principle can be translated into an 
array of alternative formats with different combinations 
5 of vectors, chains and g3 fusions as shown in figure 4. 

Another configuration is to clone the heavy chains 
as g3 fusions on phage and add purified light chains in 
vitro , as shown in fig. 5. These chains are partially 

10 denatured by the addition of Guanidine hydrochloride to 
5M final concentration, then the denaturant dialysed away 
so that heavy and light chains assemble to create 
functional antibody combining sites on the phage surface, 
which can then be selected on antigen and the appropriate 

15 heavy chain phage isolated. If necessary, the selection 
can be repeated with fresh light chain. Appropriate 
concentrations of other denaturants such as Urea or 
Potassium isothiocyanate will also prove effective. 
Having operated this procedure, one is left with a vastly 

20 reduced population of heavy chain genes which can then be 
cloned together with the light chain repertoire, 
preferably on the same replicon. The soluble chain can 
be produced by recombinant DNA technology, one or more 
monoclonal antibodies or from serum antibody. The 

25 reverse configuration, i.e. light chain on phage in 
conjunction with soluble heavy chain, or fragments of 
heavy chain, is also tenable. Also contemplated are 
alternative methods of linking heavy and light chains, 
which could be linked for example, by chemical 

30 modification. 

So far, the procedures described work on the 
principle of first reducing the complexity of the 
repertoire with possible subsequent recloning one or both 

35 chains of the reduced population. Alternative methods 
enabling both chains to be cloned on the same replicon 
with high efficiency have also been devised. These again 
rely on cloning heavy and light chain genes on separate 
replicons, but this time with the aim of promoting 

40 recombination between the two vectors so that both chains 
are placed on the same replicon. A schematic is shown in 
fig. 6 in which the recombination system is based on the 
lox P/Cre recombinase system of coliphage PI (Hoess, R.H. 
and Abremski, K. (1990) The Cre-lox recombination system. 

45 In 1 Nucleic acids and Molecular Biology' . Eckstein, F. 
and Lilley, D.M.J, eds. Vol 4, pp99-109, Springer- Verlag, 
Berlin, Heidelberg). Cre-recombinase catalyses a highly 
specific recombination event at sequences called lox, lox 
P, the recombination site in phage PI consists of two 

50 15bp inverted repeats separated by an 8bp non- symmetrical 
core (fig.6). In the configuration detailed in fig. 6 
soluble light chain is cloned onto a phagemid containing 
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a single lox P site. The heavy chains are cloned onto a 
plasmid as 3g fusions. Alongside the g3 fusion is the 
gene for a selectable marker, and the 
heavychain/g3/marker sequence flanked by two lox P sites. 
5 This plasmid also contains the Cre recombinase on a 
regulatable promoter and has an origin of double- stranded 
replication that is compatible with that on the phagemid 
in addition to that on the helper phage e.g. pl5A, RSF 
1010 and col El origins will co-exist in the same cell. 

10 The phagemids are then infected into cells containing the 
donor plasmic and the Cre recombinase promoter induced, 
so that recombination between the lox P sites occurs 
inside infected cells. Some of these recombination 
events will lead to the heavychain/g3 /marker sequences 

15 transferring as a block onto the phagemid at its single 
lox P site. Phagemids are then rescued with a helper 
phage such as M13K07 and the resulting phagemid particles 
either directly selected on antigen or infected into 
fresh host cells and grown with selection for the 

20 presence of both markers; one from the phagemid itself 
and the other from the heavychain/g3 /marker block. 

The use of site-specific recombination to bring 
genes onto the same replicon may be extended to creation 

25 of a continuous coding sequence on the same replicon, for 
example to construct single-chain Fv molecules. There is 
a single open reading frame in the loxP sequence that 
could be incorporated into an scFv linker which would 
then be a substrate for Cre-catalysed site-specific 

30 recombination. Placement of such modified scFv linker 
sequences at one or both ends of the genes to be fused 
can then result in creation of continuous open reading 
frames in vivo or in vitro when Cre recombinase is 
provided. 

35 

The strategy can be refined further if the Cre- 
catalysed recombination takes place in a polA strain of 
bacteria, preferably E.coli or other gram negative 
bacterium; these cells are deficient in DNA polymerase I 

40 and are unable to support replication of plasmids 
(Johnston, S. and Ray, D.S. 1984, supra). However, they 
are able to support replication of filamentous phage and 
plasmids containing filamentous phage intergenic regions. 
By selecting for the presence of both selectable markers 

45 in the same pol A cell, successful recombination events 
are enriched, since recombination must take place for the 
second marker gene to be replicated and expressed. The 
resulting cells are now the complete repertoire and can 
be propagated as cells and infected with helper phage to 

50 produce phagemids containing the genes for both chains 
and expressing them on their surface. 
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Other general and/or site-specific recombination 
mechanisms could also be used to effect the same outcome 
(In "Escherichia coli and Salmonella typhimurium. 
Cellular and Molecular Biology." (1987). ppl034-1043, 
5 1054-1070. Neidhart, F.C. Editor in Chief. American 
Society for Microbiology). 

It will be apparent that the concept of using two or 
more replicons to generate diversity is not confined to 

10 display on the surface of filamentous bacteriophages. 
For example, bacteria could be used as the replicable 
genetic display package. For example , Fuchs et al. have 
shown that functional antibody can be displayed on the 
surface of E.coli by fusion to peptidoglycan-associated 

15 lipoprotein (Fuchs, P., Breitling, F., Dubel, S., 
Seehaus, T and Little, M. (1991) Targetting of 
recombinant antibodies to the surface of Escherichia 
coli: fusion to a peptidoglycan associated lipoprotein. 
BioTechnology 9, 1369-1373). Klauser et al. describe 

20 transport of a heterologous protein to the surface of 
E.coli by fusion to Neisseria IgA protease (Klauser, T., 
Pohler, J. and Meyer, T.F. (1990) Extracellular transport 
of cholera toxin B subunit using Neisseria IgA protease B 
domain: conformation-dependent outer membrane 

25 translocation. EMBO 9, 1991-1999). Other surface 
proteins such as pili, ompA or the surface-exposed 
lipoprotein Tra T could also be used, and gram positive 
organisms such as lactobacilli and streptococci employed. 
Cloning and expression in Eukaryotic organisms is also 

30 contemplated. 

Alternative cloning strategies are possible when 
cells are used in place of phage. For example, replicons 
can be introduced into the cells by conjugation, in 
35 addition to transformation and infection. Moreover, one 
or more genes can be incorporated into the chromosome 
reducing the limitation of having to use compatible 
replicons. 

40 The polycombinatorial concept is also particularly 

advantageous for mutagenesis experiments by allowing far 
greater numbers of mutant progeny to be produced. 

The applicants have realised that the 
45 polycombinatorial concept is applicable to multimeric 
proteins other than antibodies , such as T cell receptors, 
CD3 and insulin receptor. Libraries of proteins having 
more than two different and diverse subunits can be 
created by, for example, more than one cycle of 
50 infection. Cells containing one of the subunits are 
infected with phage containing the second subunit and the 
resulting population infected a second time with a 
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compatible phage carrying the third subunit. 

In some cases, it is advantageous to express two or 
more component polypeptide domainns of a multimer as g3 
5 fusions. This will have the benefit of stabilising weak 
interactions between separate chains, or stabilising 
polypeptide domains which interact weakly, or polypeptide 
domains which only associate in the presence of ligand. 

10 The numbers of combinations possible with the 

polycombinatorial approach is limited only by the number 
of clones present in each of the repertoires, and, in the 
specific instance of using phage supplying one chain to 
infect cells containing the other, by the numbers of 

15 phage and cells that can be produced. The use of more 
sophisticated methods, for example fermentation 
technology, will allow even greater numbers of 
combinations to be accessed. 

20 PCT/W091/17271 filed by Affymax Inc. describes the 

expression of antibody heavy and light chains but does 
not indicate that infection of cells harbouring one 
replicon with phage harbouring another replicon allows 
libraries of greater size to be constructed. Neither 

25 does it describe the selection process for selecting an 
antibody fragment of interest using the two replicons. 
They only contemplate double selection to maintaxn the 
two chains together in the same phage or bacterium, 
which, in the format they describe, will limit the size 

30 of the library which can be constructed. There is no 
indication that with heavy and light chain libraries on 
separate vectors, the heavy ande light chains would get 
reshuffled, or how to identify the desired combination. 

A key difference over previous approaches is the use 
of separate sources of heavy and light chains and the way 
in which they are combined to produce libraries of 
greater diversity. The applicants provide methods for 
the construction of such libraries and teach how heavy 
and light chain libraries may be combined to produce 
enormous numbers of functional combinations, and means by 
which desired combinations may be selected and isolated. 
The key advantage is that libraries constructed this way 
can be several orders of magnitude larger than has 
45 previously been possible. Where two replicons are used 
they can be any pairwise combination of phage, phagemid 
and plasmid, in all cases with the antibody chains 
expressed as soluble fragments or associated with the 
phage capsid. At least one of the vectors is capable of 
being incorporated into an infectious phage-like particle 
and at least one of the vectors enables association of 
the antibody chain with the phage capsid, for example by 
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fusion to g3p. Any of the above configurations can be 
used in novel humanisation/mutagenesis procedures. 

The "chain-shuffling" combinatorial approach is a 
5 particularly useful embodiment of the present invention. 
One may take for instance, a single heavy chain, or a 
restricted number of heavy chains from an antibody known 
to have the desired antigen specificity of even from a 
repertoire of antibodies from human or animal immunised 

10 with an antigen of interest, and combine a population of 
such chains with a, perhaps very large, genetically 
diverse population of, in this instance, light chains 
fused to a rgdp component. The light chains would be 
expressed from nucleic acid capable of being packaged in 

15 a rgdp. One would then select for rgdps which each a 
display light chain with an associated heavy chain 
forming an antibody specific for an antigen of interest. 
Such rgdps would each contain nucleic acid encoding a 
light chain. Light chains of the restricted population 

20 so selected would then be combined with a genetically 
diverse population, perhaps a very large population, of 
heavy chains fused to a rgdp component and expressed from 
nucleic acid capable of being packaged in rgdps. A 
second round of selection for rgdps displaying specific 

25 binding pair members specific for the antigen of interest 
would yield a restricted population of heavy chains 
capable of associating with the previously selected light 
chains to form antibodies of the desired specificity. 

30 This technique enables reduction of population 

diversity to an easily manageable level whilst sampling a 
very large number of combinations. Nothing in the prior 
art approaches this. It may be used advantageously in 
the humanisation of antibodies isolated/purified from a 

35 non-human animal source. 

The following examples illustrate how these concepts 
may be put into practice. It will be evident to those 
skilled in the art that many variations on these themes 
40 will produce satisfactory results. The following 
examplify the concept by way of illustration only and not 
by way of limitation. 
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Example 1: Rescue of phagemid with phage fd-DOG-1. 

in this example, the concept of using a phage to 
•rescue' a phagemid (fig 2) is tested using model 
chimaeric antibody. 

One chain is expressed in pUC19,pUC119 or pHBN-1 as 
a soluble periplasmic protein and the corresponding chain 
cloned on fd-DOG-1 as a g3 fusion. Both chains have come 
fcSTlSdSa ™ fragment cloned in pUC19, in which the 
SSt and I heavy V-region domains from the mouse ant i-phOx 
(2?pnen?I-5-oxIzalonS) antibody NQ10.12.5 have been fused 
to huma£ C and C 1 domains respectively. The C-terminal 
cysteine residues, which normally form a covalent link 
15 Slwetn the light and heavy chains, have been deleted 
rrom TotlTconstant domains in this construct, and this 
feature is retained in subsequent constructs. The 
sequence of the template clone is shown xn fig. 7. 

20 The strategy for cloning heavy and light chains as 

separate^ elemenS is depicted in fig. 8; briefly, the 
chains were separately PGR amplified with primers that 
incorporate appropriate restriction sites onto the ends 
of the fragments. These fragments were then cloned into 

25 pHEN-1 and fd-DOG-1 in both configurations i.e. heavy 
chain on phage, light chain on phagemid and vice versa. 
Thl heavy chain Sfi I-Not I fragments were also cloned 
into pUC19 and pUC119 derivatives which have had the 
polylinker between the Eco RI and Hind III sites replaced 

30 with the sequence shown in fig 9, which contains 
compatible Sfi I and Not I sites. These clones are 
called pUC19/pUC119 Sfi-Not polymyc. The pHEN-1 clones 
Sere transformed into E.coli strain HB2151 which is a 
male, lad and a non-suppressor, causing the amber codon 

35 in pHEN-1 to be read as a stop codon, thereby producing 
soluble chain exported to the periplasm. The remainder 
of the constructs were transformed into E.coli TGI. 

These cells are then 'rescued' with fd-DOG-1 phage 
40 carrying the partner chain as a g3 fusion oresul- ting 
phage/phagemid population assayed for phOx binding in 
ELISA. 

In this example, sections a) to g) describe 
45 preparation of plasmid, phagemid and ophage clones; 
sections i) and j) show the effect of using phage to 
resSe phagemid or plasmid, and the effect that has on 
antibody expression. 

50 a) PCR amplification of heavy and light chains 

Plasmid pUC19 Fab NQ10.12.5DNA was used as the 
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template for PCR amplification. Four separate PCR 
reactions were set up, with the following pairwise 
combinations of primers (TABLE 1): 

5 Heavy chain fd-DOG-1 phage: VH1BACKAPA & FABNOTFOH 

Heavy chain pHEN-1 phagemid: VH1BACKSFI & FABNOTFOH 

Light chain fd-DOG-1 phage: MVKBAAPA & FABNOTFOK 

Light chain pHEN-1 phagemid: MVKBASFI & FABNOTFOK 

10 PCR reactions contained lOmM Tris-HCl(pH8.3), 50mM 

KCI, 1.25mM each dNTP, 2.5mM Mg C12, 0.01% gelatin, 0.1 
unit/|il Taq polymerase ( Cetus/Perkin Elmer), ImM each 
primer and lng of template DNA. PCR was carried out in a 
Techne PHC-2 thermal cycler (Techne, Duxford, Cambridge 

15 U.K. ) using 25 cyles of 1 minute at 94°C, 1 minute at 
50 °C and 2 minutes at 72 P C. 

b) Digestion of PCR fragments 

20 The resultant products were extracted with 1:1 

phenol: chloroform then ethanol precipitated as described 
in Sambrook et al. (Sambrook, J., Fritsch, E.F. and 
Maniatis, T. (1990) "Molecular cloning-a laboratory 
manual". Cold Spring Harbor Laboratory, New York.), and 

25 pelleted DNA redissolved in 35pl water. Restriction 
digest were normally carried out in 100- 200^1 volumes 
with 0.3-0.4 units enzyme/pl (volume of enzyme added not 
to exceed l/20th of the total reaction volume) using 
conditions recommended by the manufacturer, and in the 

30 buffer supplied by the manufacturer. Digestion with 0.4 
units/pl Not I enzyme (new England BioLabs) was carried 
out in 150yl volume according to manufacturers 
instructions and in the buffer provided by the 
manufacturer for 3hrs at 37 °C. The products were 

35 phenol: chloroform extracted and ethanol precipitated once 
more and digested either with ApaLI or Sfi I. Apart from 
the Apa LI digest of the VHCH1 Apa LI -Not I fragment (see 
below), digestions were carried out using 0.4 units/jil 
enzyme in a total of ISOpl according to manufacturers 

40 instructions (New England BioLabs) and in the buffer 
provided by the manufacturer. Apa LI digests were 
carried out for 3 hrs at 37°C, Sfi I for 3 hrs at 50°C. 

The Apa LI digest of the VHCH1 Apa LI -Not I fragment 
45 had to be a partial due to the presence of an internal 
Apa LI site. This was achieved by digestion with 0.04 
units/yl Apa LI for 1 hour and the full-length fragments 
isolated from an agarose gel ( see below ) . 

50 c) Purification of DNA fragments 

Reaction products were ethanol precipitated to 
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reduce their volume then run on a Preparative 2% Low 
Selting Point agarose/TAE (Tris-Acetate EDTA) gel, the 
cat 700bp bands excised and the DNA fragments Purxfxed 
using a P geneclean kit in accordance with ^ the 
5 manufacturers instructions (Bio 101, La Jolla, 
^fcrnS, USA) . DNA fragments were resuspended in TE 
(!omM Tris-HCl (pH 8.0), O.lmM EDTA) and legated to 
prepared vector (sections d and e). 

10 d) Preparation of vector DNA 

lOug Caesium chloride-purified fd-DOG-1 was digested 
with Ana LI and Not I as above. lOug Caesxum chloride- 
puSffe? PUC19 Sfi/Not, PUC119 Sfi/Not and pHEN-1 DNAs 

15 were digested with Sfi I and Not I, using the same 
Editions as those described in b) above Following 
digestion and ethanol precipitation, vector DNA was 
phosphatased with . 1 unit Calf Intestinal Alkaline 
Phosphatase in 50ul of the buffer recommended and 

20 supplied by the manufacturer (Boehringer Manheim UK Ltd., 
Bell Lane, Lewes, East Sussex, BN7 1LG) as a lOx stock, 
for 30 minutes at 37'C, then another 1 unit of enzyme 
added and the incubation repeated. Forty ul of w^er 
lOul of lOx STE (lOx STE is lOOmM Tris-HCl, pH (8.0) 1M 

25 NaCl, 10mM EDTA) and 5ul 10% SDS was then added and the 
mixture incubated at 68'C for 20 minutes to inactivate 
the phosphatase. The mixture was then cooled on ice 
SLfly Ind extracted twice with 1:1 phenol :chloroform 
then ethanol precipitated as described in Sambrook et al. 

30 (1989, supra). 

e) Ligations 

The following ligations were set up: 

PUC19 Sfi I-Not I +VHCH1 Sfi I-Not I 

PUC119 Sfi I-Not I +VHCH1 Sfi I-Not I 

pHEN Sfi I-Not I +VHCH1 Sfi I-Not I 

pHEN Sfi I-Not I +VLCL Sfi I-Not I 

40 fd-DOG Apa LI-Not I +VHCH1 Apa LI -Not I (P^^J tJ) 

fd-DOG Apa LI-Not I + VLCL Apa LI -Not I 

For each, the following ligation reaction was set 



35 



45 up: 



10 x NEB-Ligation Buffer 1 ul 

K 111 



6 ^1 
2 ul 
1 ^1 

Spin for a few seconds in the microfuge. Then add: 



* water 

* Digested vector (30 ng/ul) 2 ul 
50 * Digested PCR fragment (20-50 ng/ul) l Ul 
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* T4-DNA ligase (400 units/pl, NEB) 1 pi 
3- Leave for 2 hrs at 25 °C or overnight at 16 °C. 
4. Transform into E.coli (see below). 

Note ; 10 x NEB-Ligation Buffer is 0.5M Tris-HCl, pH 

7.8, 0.1M MgCl2, 0.2M DTT, lOmM rATP and 500 
jig/ml BSA. 

f ) Electroporation into TGI or HB2151 



1. Thaw a vial of electroporation-competent bacteria on 
ice. For soluble expression of antibody fragments 

15 from pHENl , the non-suppressor strain HB2151 is 

used. For the others, TGI is used. 

2. Transfer 50pl cells to a prechilled 0.2cm cuvette 
(Biorad), add 2 yl ligation mix, shake to the bottom 

20 and sit on ice for 1 min. 

3. Set up the Gene Pulser (Biorad) to give 25 pF, 2.5kV 
with the pulse controller set to 200 ohms. 

25 4. Dry the cuvette with tissue and place in the 
electroporation chamber. 



5. Pulse once (should yield a pulse with a time 
constant of 4.5 to 5 msec). 

6. Immediately add 1 ml of SOC (fresh) to the cuvette 
and resuspend the cells. 



7. Transfer to disposable culture tube, and shake for 1 
35 hr at 37°C. 

8. Plate fractions on 2YT agar plates containing 100 
}ig/ml ampicillin, 1% glucose for pHEN and pUC 
replicons or 2YT agar plates containing 15pg/ml 

40 tetracyclin for fd-DOG. 

Note 1 : SOB is 20g Bacto-tryptone, 5g Yeast extract and 
~ ^ 0.5g NaCI, in 1 litre. 

SOC is SOB with 5ml 20% glucose, 1ml 1M MgCl2 
45 and 1ml 1M MgS0 4 added per 100ml. 

2YT is 20g Bacto-tryptone, lOg Yeast extract 

and 5g NaCI, in 1 litre. 

Note 2 : To increase transformation efficiencies the DNA 
50 — in the ligation mix can be purified by 

extracting with phenol, phenol-chloroform and 
ether, ethanol precipitating and resuspending 
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in water. Alternatively the samples can be 
cleaned up using geneclean (Bio 101). 
Efficiencies will go up 10-100 fold xf this 
purification step is included. 

The desired clones were screened by PCR with the 
primers used to clone the fragments and their identity 
confirmed by DNA sequence analysis. Infectious particles 
were then produced from these clones (see below). 



g) 



Preparation of fd-DOG infectious phage particles 



1 inoculate colony of bacteria containing fdDOG into 
2YT broth containing 15 ug/ml tetracycline. Grow 
15 37 °C, shaking for 20-24hrs. The yield of phage 

particles should be about 10 10 TU (transducing 
units-see below) ml" 1 of supernatant. 

2. Spin 8,000 r.p.m. for 10 min (or 4,000 r.p.m. for 20 
20 min ) . 

3. To supernatant add l/5th volume PEG/NaCl (20% PEG 
6000, 2.5M NaCl), leave 1 hr at 4°C. Spin 8,000 
r.p.m. for 15 min (or 4,000 r.p.m. for 30 mm). 

4. Resuspend pellet in a small volume of water and 
transfer to eppendorf tubes. 

5. Spin down any remaining cells (5 min in microfuge). 

5. To the supernatant add l/5th volume of PEG/NaCl . 

6. Remove supernatant and respin the pellet briefly. 
35 7. Aspirate off any remaining PEG. 

8. Resuspend the pellet in water (l/100th original 
volume of culture) and respin 2 min in microfuge to 
remove residual bacteria and agglutinated phage. 

40 Filter through 0.45um sterile filter (Minisart NML; 

Sartorius ) . 

9. Store the filtrate at 4»C. The concentration of 
phage should be about 10 1Z TU ml" A . 

h) Infection experiments 

Preliminary experiments using fd-DOG phage to infect 
pHEN-1 suggested that there is interference between the 
two vectors because both carry a filamentous phage 
intergenic region, causing competition for the same 
replication machinery. The experiments below confirm 
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this* 

TGI cells, either untrans formed or harbouring pUC19, 
PUC119, pUC19 NQ10.12.5 chimaeric VHCH1 and pUC119 
5 NQ10.12.5 chimaeric VHCH1 were grown to mid-log phase at 
37 °C in 2YT medium, without antiboitic for TGI or 
containing lOOpg/ml ampicillin and 1% glucose for the 
recombinants. Cultures were then diluted to contain ca. 
10 8 cells/ml using the approximation O.D. 60 o 1*0=5.10 8 

10 colony forming units (cfu)/ml, and fd-DOG phage added to 
3.10 9 tetracycline resistance transducing units (TU, 
assayed on TGI) per ml. After incubation at 37 °C for 30 
minutes, dilutions were plated on 2YT agar containing 
15jig/ml tetracyclin and 1% glucose and incubated at 37 °C 

15 overnight to give the number of successful infection 
events (^number of tet r colonies). The result is shown 
graphically in fig. 10. There is little difference in 
titre when fd-DOG phage infect TGI cells or TGI cells 
containing pUC19- the titre is equal to the number of 

20 input cells (since in this experiment phage were in 
excess). However, when the host cell contains pUC119 (or 
any vector based on pUC119) the number of healthy tet r 
colonies (2-3mm diameter) is reduced ca. 100-fold, and 
there are now numerous tiny colonies (0.3-0. 5mm 

25 diameter). m These small colonies do not grow in liquid 
culture with tetracycline, and, as the number of 
small+large colonies=input number of cells it is believed 
that they are infection events in which the phage has not 
established. 

30 

The presence of an antibody heavy chain in pUC19 or 
pUC119 has little effect on yield of tet r colonies after 
infection with fd-DOG. Similar interference is observed 
when M13 is substituted for fd-DOG and the number of 
35 successful infection events assayed by plaque numbers 
(fig. 10). The ability to recover functional antibody on 
the surface of phage was then tested - sections i and j 
below. 

40 i) Rescue of Fab phage 

The following rescues were set up; all chains are 
NQ10.12.5 chimaeras: 

45 pUC19 Heavy chain X fd-DOG- Light chain 

pUC119 Heavy chain X fd-DOG- Light chain 
pHEN Heavy chain X fd-DOG-Light chain 
pHEN Light chain X fd-DOG-Heavy chain 
pHEN Light chain X fd-DOG-Light chain 

50 pHEN Heavy chain X fd-DOG-Heavy chain 

Host cells (HB2151 for pHEN-1, TGI for pUC ) 
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containing plasmid/phagemid with one of the two Fab 
chains were grown overnight at 37"C in 2YT containing 
lOOug/ml ampicillin and 1% glucose (2YTAG). Glucose is 
used to repress the lac promoter and thereby reduce 
5 expression of the antibody gene. The stationary cultures 
were diluted 1:100 into lOmls fresh 2YTAG xn a 50ml 
polypropylene tube (Falcon) and grown at 37 'C to an 
O.D.finn of 0.5 before adding concentrated f d-DOG phage 
containing either the heavy or light chain genes as g3 

10 fusions to 1.10 9 TU/ml final concentration. These 
cultures were then incubated at 37 °C without shaking for 
30 mins then with rapid shaking for another 30 mins. 
Cells were then pelleted by centrifugation at 4,000x g 
for 20 mins at 4'C, the tubes drained and the cells 

15 resuspended in lOmls fresh 2YT containing lOOug/ml 
tetracyclin (no glucose -> induction) and grown with 
vigorous shaking overnight at 37 °C. 

The next day, cells were pelleted by centrifugation 
20 at 4 OOOx g for 20 mins at 4°C, and the supernatant 
assayed for the presence of functional antibody by ELISA. 

j ) ELISA 

25 1. Coat plate (Falcon 3912) with lOOul of PhOX-BSA 

(14:1 "substitution) per well at 10 ug/ml, in PBS. 
Leave overnight at room temp. 

2. Rinse wells 3x with PBS, and block with 200 ul per 
30 well of 2% Marvel/PBS, for 2 hrs at 37°C. 

3. Rinse wells 3x with PBS, then add 25 pi 10% 
Marvel/PBS to all wells. 

35 4. Add lOOul culture supernatant to the appropriate 
wells. Mix, leave 2 hrs room temp. 

5 Wash out wells 3 times with PBS, 0.05% Tween 20 and 
3 times with PBS. Add lOOul sheep anti-M13 
40 antiserum diluted 1:1000 in 2% Marvel/PBS into each 

well. Incubate at room temp, for 1.5 hrs. 

6. Wash out wells 3 times with PBS, 0.05% Tween 20 and 
3 times with PBS. Pipette lOOul of 1:5000 dilution 

45 of anti-sheep IgG antibody (peroxidase-conjugated, 

Sigma). Incubate at room temp, for 1.5hrs. 

7. Discard 2nd antibody, and wash wells 3 times with 
PBS, 0.05% Tween 20 and 3 times with PBS. 

3U 8 Add one lOmg ABTS (2,2'-azino bis(3- 
ethylbenzthiazoline-6-sulphonic acid), diammonium 
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salt) tablet to 20ml 50 mM citrate buffer, pH4.5. 
(50 mM citrate buffer, pH4.5 is made by mixing equal 
volumes 50 mM trisodium citrate and 50 mM citric 
acid ) . 

5 

9. Add 20pl 30% hydrogen peroxide to the above solution 
immediately before dispensing, 

10. Add lOOyl of the above solution to each well. Leave 
10 room temp. 30 rains. 

11. Quench by adding 50pl 3.2mg/ml sodium fluoride. 
Read at 405mm. 

15 Note 1 : 'Marvel' is dried milk powder. PBS is 5.84g 
NaCl , 4 . 72g Na 2 HP0 4 and 2 . 64g NaH 2 P0 4 . 2H 2 0 , 
pH7.2, in 1 litre. 

The result is shown in figure 11, where it can be 

20 seen that whether the resident replicon is a plasmid or a 
phagemid has a significant effect on the amount of 
antibody rescued by fd-DOG carrying the complementary 
chain. The signal generated from cells carrying the 
heavy chain on pUC19 is ca. lOx that from cells carrying 

25 this chain on pUC119 when rescued with fd-DOG carrying 
the light chain. pHEN gives similar results to pUC119, 
when the heavy chain is expressed in pHEN and the light 
chain on fd-DOG, and vice versa. Although weak, the 
signal is specific since no antibody is produced when the 

30 incoming phage carries the same chain as that on pHEN. 
Clearly, the use of phage carrying one chain to rescue 
phagemid carrying the complementary chain does work, 
though the system would benefit from manipulation of 
phage or phagemid replicons to alleviate interference. 

35 An alternative is to use a plasmid vector in conjunction 
with phage which will result in selection of just one of 
the two chains. Such a strategy is used in example 2. 
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Example 2: Polycombinatorial Libraries 

In this example a human VH repertoire is cloned into 
fd-DOG carrying the human CHI domain of IgGl (Cyl 
5 domain). The repertoire of light chaxns is cloned xnto 
PUC?9 Sfi/Not/polymyc plasmid as Sfi/Not fragments and 
Separately into pHEN phagemid flanked by an Sfx sxte at 
one end and Asc and Not sites at the other. 

10 Fd-DOG heavy chain phage are used to 'rescue' the 

PUC19 light chaxns and the resulting phage selected on 
Ltigen. The heavy chain population is reduced xn this 
way then PCR-amplified with primers incorporating Asc I 
and Not I sites and these fragments cloned alongsxde the 

15 unselected light chains in pHEN phagemid The genes for 
both chains are now on the same replxcon and can be 
selected simultaneously with antigen after rescue with 
helper phage. 

20 The VH domains used in this example derive from IgM 

but are cloned as IgGl fd (i.e. VHCH1) fragments wxthout 
the C-terminal cysteine. 

Sections a and b below describe construction of an 
25 fd-DOG derivative containing a human C T 1 domain from 
whicTthe natural Apa LI site has been dieted. Sections 
e to g cover preparation of the heavy and light chaxn 
repertoires. Section f describes how these repertoxres 
may be used to isolate specific antibody fragments. 

30 . _ __, 

a) Construction of A Apa LI versxon of CHI 

Construction of a A Apa LI version of CHI is 
described here. The construction is somewhat involved 
35 since the primary purpose was construction i of a new 
phagemid cloning vector, and removal of the Apa LI site 
in CHI only one step in the procedure. The new phagemid 
containing the A Apa LI version of CHI was used as 
template in the next section. 

The naturally-occurring Apa LI site in the human C^l 
domain was deleted by 'PCR mutagenesis' using partxally 
complementary oligonucleotides that overlap around the 
Apa LI site and change this sequence from CTGCAC to 
45 GTGGAC. Two PCR reactions were performed and the two 
fragments joined in a second PCR-splicing by overlap 
extension. 

Two PCR reactions were set up as in example 1 using 
50 RJHXHOBACK in conjuction with APAOUTFOR and APAOUTBACK 
in conjuction with HUGICYSASCNOTFOR (Table 1). Template 
was pUC19 Fab D1.3 - this anti-lysozyme Fab has the same 
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human IgG CHI domain as pUC19 Fab NQ10.12.5 described in 
example 1. PCR conditions were as described in Example 
1, except that the annealing temperature was 55 °C and 
extension was at 72 °C for 1 minute. The resulting 
5 fragments were purified for assembly as follows: 

1. To isolate small fragments for assembly-PCR, the 
PCR-mixes are collected in one tube and 
electrophoresed on a 2% LGT (low gelling 

10 temperature) agarose gel with TBE buffer. 

2. These fragments are purified using SPIN-X columns 
( Costar ) . 

15 3. The gel slice is loaded into the cartridge of a 
SPIN-X column and frozen in dry ice for 10-15 min. 

4 . Thaw tube and repeat freezing step ( optional ) . 

20 5. Spin in a microfuge (appr. 13,000 r.p.m. ) for 15-30 
min. 



25 



35 



6. Precipitate the filtrate by adding 1/10 vol. 3M 
sodium acetate, pH5.2, and 2.5 vol. ethanol. 

7. Chill on dry ice for 15 min, spin at 13,000 r.p.m. 
for 10 min at 4°C. 



8. Wash the pellet in 1 ml 70% ethanol and dry under 
30 vacuum. 

9. Dilute purified linker into 5 pi water or TE per 
original 50 ]il PCR, and measure concentration on 



gel. 



These fragments are then joined by PCR. A 50 jil PCR 
reaction is set up as before, this one containing 5 pi 
each fragment but no primers. The reaction is held at 
95°C for 5 rains, then cycled at 94*C 1 min, 68°C 1 min 

40 and 72 °C 1 min, seven times. RJHXHOBACK and 
HUGICYSASCNOTFOR flanking oligonucleotides were then 
added under the oil and the reaction cycled another 10 
times using the same conditions to amplify those 
molecules that have correctly assembled. The reaction 

45 product was phenol extracted and ethanol precipitated 
then digested with Xho I and Asc 1 and gel-purif ied, all 
under standard conditions. 

A second fragment containing the entire gill leader 
50 plus polylinker of fd-DOG was also amplified. Fd-DOG DNA 
was amplified with primers G3 LASCGTGBACK and fdseq, cut 
with Asc I and Not I and gel-purified, all as described 
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in example 1. 

A 3-way ligation was set up with equimolar 
quantities of Xho I-Not I -cut pHEN DNA, Xho I-Asc I -cut 
5 fragment and Asc I-Not I -cut gill ^f^j^S^SS. 

fraoment The resulting mixture was transformed into TGI 
as Ascribed in example 9 1 and clones with the structure: 
oHEN^pelB leader/Sfi I/Nco I/Xho I-C T 1-Asc I/glll Leader- 
Apa LI-Not I/gIII identified by alkaline lysis ".iniprep 
10 and restriction enzyme analysis (Sambrook et al. (1989) 
Sr£TT The integrity was verified by DNA sequencing 
Sambrook et al. (1989) supra.) and a representative 
clone called pJIM C T lgIIIL, which carries the C T 1 domain 
from which the internal Apa LI site has been removed. 



15 



45 



b) Insertion of CHI domain into fdDOG 



The A Apa LI version of CHI was now used as a 
template for construction of an f d-DOG ; derivative 

20 containing the CHI domain lacking the Apa LI site. In 
Sis construct the C-tenninal cysteine residue normally 
rormin? a disulphide bond with the light chain was 
Averted to serine by PCR with the FABNOTFOH primer. 
This vector is now suitable for cloning of the repertoire 

25 of VH domains as Apa Li-Sal I fragments. 

50ng of pJIM C T lgIIIL template DNA was PCR amplified 
in a 50ul reaction using the conditions described in 
Example £ 33? the primers FDGAM1BAAPA and FABNOTFOH 

30 which bring in Apa LI and Not I sites; the FABNOTFOH 
primer also removes the C-terminal cysteine. The ca. 
300bp PCR fragment was then processed and digested with 
Apa LI and Not I using conditions described in example 1. 
The Apa LI-Not I fragment was then cloned into the 

35 Preparation of Apa LI and Not I -cut fd-DOG ^A Prepared 
for the experiments described in example 1, and ligated 
and transformed into TGI using the same procedures. 

The sequence of the vector was verified by DNA 
40 sequencing (Sambrook J. et al, 1989, supra) of single- 
stranded DNA isolated from filamentous Particles 
(Sambrook J. et al, 1989, supra) Prepared in the usual 
way (example 1) using the primer fdseq (Table 1). The 
vector was then CsCl-purified (Sambrook J. et al, 1989 
supra) and cut with Apa LI and Sal I. Both enzymes were 
from New England BioLabs, and sequential digestions ustog 
0.4 units enzyme/ul performed at 37"C in the buffers 
provided by the manufacturer (see example 1). 

50 c) Preparation of cDNA template 

500ml of blood, containing approximately 10 8 B- 
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lymphocytes, was obtained from 2 healthy volunteers. The 
white cells were separated on Ficoll and RNA was prepared 
using a modified method (Cathala, G., J. Savouret, B. 
Mendez, B.L. Wesr, M. Karin, J. A. Martial and J.D. Baxter 
5 (1983). A method for isolation of intact, 
transcriptionally active ribonucleic acid. DNA 2, 329). 
Three first strand cDNA syntheses were made as described 
by Marks et al (1991, supra) from RNA corresponding to 
2.5 X 10 7 B-cells, using an IgM constant region primer 
10 for the heavy chains, and a tC or X constant region primer 
for light chains (Table 1). 

d) PCR of Heavy Chains 

15 Two preparations of PCR-amplif ied VH genes were 

made. Both preparations used an eguimolar mixture of the 
HUJHFOR primers (Table 1); in one of the preparations, 6 
separate PCR amplifications were performed with each of 
the HUVHBACK primers individually (Table 1), and in the 

20 other, a single PCR reaction was performed with an 
equimolar mix of all 6 HUVHBACK primers. For all seven 
PCRs, 50 \xl reaction mixtures were prepared containing 5 
]il of the supernatant from the cDNA synthesis using the 
HUIGMFOR primer, 20 pmol total concentration of the BACK 

25 primers, 20 pmol total concentration of the FORWARD 
primers, 250 pM dNTPs, lOmM KC1, lOmM (NH 4 ) 2 S04, 20mM 
Tris.HCl (pH 8.8), 2.0 mM MgC12, lOOmg/ral BSA and 1 pi (1 
units) Vent DNA polymerase (New England Biolabs). The 
reaction mixture was overlaid with mineral (paraffin) oil 

30 and subjected to 30 cycles of amplification using a 
Techne PHC-2 thermal cycler. The cycle was 94 °C for 1 
minute ( denaturation ) , 57 °C for 1 minute (annealing) and 
72 °C for 1 minute (extension). The products were 
purified on a 1.5% agarose gel, isolated from the gel by 

35 Geneclean (Bio-101) and resuspended in 25 \xl of H 2 0. The 
seven products were then pooled and ' pullthrough ' PCR 
reactions performed to attach Apa LI and Sal I 
restriction sites. 

40 Pullthrough reactions were set up with the primers 

HUVHBAAPA (equimolar mix of all 6 primers) and HUJHFORSAL 
(equimolar mix of all 4 primers). 50ml reactions of 
containing 5pl of the pooled PCR products from the 
previous step were amplified using the same conditions as 

45 for the primary PCR except that 25 cycles of 
amplification were used. The resulting fragments were 
digested with Apa LI and Sal I, gel-purified, and the 
fragments ligated to Apa LI and Sal I-cut fd-DOG CHI as 
previously described. The ligation mixes were phenol - 

50 chloroform extracted prior to electroporation into TGI 
cells (example 1). Aliquots of the transformed cells 
were plated on 2YT agar supplemented with 15 mg/ml 
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tetracyclin and grown overnight at 37 °C. 
e) PCR of Light Chains 

5 K and X-chain genes were amplified separately using 

an eguilmolar mixture of the appropriate family based 
BACK and FORWARD primers (Table 1). It -chain genes were 
amplified from the cDNA synthesis using HUCKFOR primer, 
using an equimolar mixture of the 6 HUVKBACK la-6a 

10 primers in conjunction with the HUCKFORSER primer. X- 
chain genes were amplified from the cDKA synthesis using 
the HUCLFOR primer, and amplified using an equimolar 
mixture of the 7 HULBACK 1-6 primers in conjunction with 
the HUCLFORSER primer. In each case 50ul reaction 

15 mixtures were prepared containing 5ul of the supernatant 
from the appropriate cDNA synthesis, 20pmol total 
concentration of the BACK primers, 20 pmol total 
concentration of the FORWARD primers, 250uM <»™5"! M 
KC1, 10mM (NH 4 ) 2 S0 4 , 20mM Tris.HCl (pH 8.8), 2.0mM MgC12, 

20 lOOmg/ml BSA ana lul (1 unit) Vent DNA polymerase (New 
England Biolabs). The reaction mixture was overlaid with 
mineral (paraffin) oil and subjected to 30 cycles of 
amplification using a Techne thermal cycler. The cycle 
was 94-C for 1 minute (denaturation) , 57°C for 1 minute 

25 (annealing) and 72'C for 2.5 minutes (extension). The 
products were purified on a 2% agarose gel, isolated from 
the gel by Geneclean (Bio-101) and resuspended in 25ul of 
H 2 0. 

30 Two different pullthrough reactions were now 

performed on each of the two light chain preparations . K. 
-chain genes were amplified in two reactions, using an 
eouimolar mixture of the 6 HUVKBASFI primers in 
Conjunction with either HUCKFORSERASCNOT or 

35 HUCKFORSERNOT. ^ -chain genes were also amplified in two 
reactions, using an equimolar mixture of the 7 HUVLBASFI 
primers in conjunction with either HUCLFORSERASCNOT or 
HUCLFORSERNOT. Pullthrough conditions were performed as 
for the primary light chain PCRs above except that 25 

40 cycles of amplification were used. All 4 PCR products 
were digested with Nco I and Not I using the same 
conditions as used previously (example 1 and above). 
Those K and X -chain genes amplified with the SERASCNOT 
foreward primers were inserted into Nco I-Not I-cut pHEN- 

45 1 vector (prepared using the standard format); those 
amplified using the SERNOT foreward primers were inserted 
into Nco I-Not I-cut pUC19 Sf i/Nco/Not/polymyc . Both 
repertoires were electroporated into TGI by the same 
methods as described in example 1: the ligation mixes 

50 were purified by phenol extraction and ethanol 
precipitated. The ligated DNA was resuspended in 10ul of 
water and 2.5ul samples were electroporated into 50ul 
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E.coli TGI . Cells were grown in 1ml SOC for 1 hr and 
then plated on 2YT agar with lOOpg/ml ampicillin and 1% 
glucose (2YTAG) in 243 x 243 mm dishes (Nunc). Colonies 
were scraped off the plates into 10ml 2YTAG and 15% 
5 glycerol for storage at -70 °C as library stocks. 

f ) Selection of Antibody Fragments 

The end result of sections a) to e) is the 
10 construction of two light chain libraries and one heavy 
chain library. The light chain libraries (VLCL) were 
cloned as Nco I -Not I fragments in both pHEN-1 and in 
pUC19 Sfi-Not polymyc. The VH genes were cloned as Apa 
Li-Sal I fragments in fd-DOG containing a human CHI (Cyl) 
15 domain from which the natural Apa LI site has been 
deleted (fd-DOG G1CH1). 

The process is now illustrated schematically in 
figure 13. The pUC19 heavy chain library is infected 

20 with the heavy chain fd-DOG library and fd phage bearing 
heavy and light chain combinations on their surface 
produced. These phage are selected on antigen (with 
rounds of reinfection of the light chain library and 
reselection an option) and the selected heavy chain genes 

25 amplified from the phage genome using PCR with primer 
G3LASCGTGBACK and fdseq. 

The G3LASCGTGBACK primer anneals to fd DNA upstream 
of the start of the natural g3 signal sequence and brings 

30 in an unique Asc I site and a ribosome binding site 
(RBS). Fdseq anneals at the 5' end of the structural 
part of g3 (i.e. downstream of the signal sequence 
cleavage site) which is downstream of the Not I site 
flanking the Cyl domain. The PCR fragments therefore 

35 contain: Asc I-RBS-g3 leader- VHCH1 -Not I. When these 
fragments are cloned into the pHEN-1 light chain library 
as Asc I -Not I fragments, the heavy chain is fused to g3 
of pHEN-1 and the Not I site is flanked by the myc tag 
and an amber codon. 

40 

Phagemid particles produced from these recombinants 
in a sup E strain not only bear heavy and light chains on 
their surface, they also contain the genes encoding both 
heavy and light chain. Selection on antigen one or more 

45 times will then result in isolation of functional heavy 
and light chain combinations. These clones can then be 
screened for antigen binding as soluble fragments when 
the phagemids are transferred into a non-suppressor 
strain such as HB2151. An efficient selection process 

50 would be particularly advantageous in this example. 
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Example 3: Soluble Combinatorial Libraries 

In this example either the light chain or the fd 
fragment of the heavy chain (i.e. VHCH1) is expressed on 
5 phage particles and the complementary chain provided as a 
soluble fragment. The mixture is partially denatured 
such that the chains separate but the domains don't 
unfold. The denaturant is then slowly removed by 
dialysis, allowing light and heavy chains to pack 
10 together so that they can be selected on antigen. The 
effect of this procedure is that the population of one of 
the two chains (whichever is attached to the phage) is 
greatly reduced and can then be cloned together with the 
repertoire of partner chains by conventional techniques. 

lj This approach is theoretically tenable: 1.10 11 TU of 

phage provide ca. 10 12 heavy chains if one assumes 2-3 
molecules of heavy chain per phage and then assays for TU 
produce a 10-fold underestimate of the absolute number of 

20 phage. One ug of a 25Kd light chain is 2.4.10- 1 " 3 
molecules- a 10-fold molar excess. 

Clearly, many variations on this theme are possible 
once the principle behind it is shown to be correct. For 

25 example the positions of heavy and light chains could be 
reversed and the chains themselves could derive from 
several different sources. For example human fd heavy 
chains on phage could be mixed with light chains from a 
soluble light chain repertoire expressed in E.coli. 

30 Alternatively, the light chains could be purified from 
human serum. 

The following example demonstrates not only that the 
principle is valid, but also that the process is 
35 surprisingly efficient. 

a) Preparation of Heavy Chain Phage 

VHCH1 NQ10.12.5 in fd-DOG was that constructed in 
40 example 1. VHCH1 anti-TNF in fd-DOG was used as a 
negative control. This clone was constructed from a 
murine anti-Tumour Necrosis Factor monoclonal antibody. 
The VH gene of this antibody was PCR amplified and linked 
in frame, by PCR, to the human CHI (C T I) domain from 
45 which the natural Apa LI site has been deleted (see 
example 2) and cloned as Apa LI-Not I fragments in 
fd-DOG. These phage were grown and concentrated as 
described in example 1. 

50 b) Preparation of soluble heavy and light chain 
NQ10.12.5 
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These were prepared from the parental mouse 
monoclonal antibody NQ10.12. 5. Culture supernatant was 
purified on protein A sepharose and concentrated to 
20mg/ml by Araicon ultrafiltration. This was then diluted 
5 with an equal volume of 1M Tris pH8.0 and l/10th volume 
of 1M 2-mercaptoethanol added; this amount of 
mercaptoethanol is sufficient to reduce the interchain 
but not the intra-chain disulphide bonds. After 1 hour 
at room temperature the free thiol groups were alkaylated 
10 by addition of l/10th volume 1.5M iodoacetamide and 
incubation on ice for 1 hour. 

The separated chains were purified by gel -filtration 
on a Zorbax Biosieves GF250 TSK column (DuPont) connected 
15 to an HPLC (Gilson). Two hundred and fifty pi aliquots 
of the reduced mixture were run in 5M Guanidine HCl/20mM 
Sodium phosphate , pH8.0 under the following conditions: 

Flow rate ■ 0.5mls/min; Fraction volume = 0.25mls; chart 
20 speed = lOmm/min; detector range = 2.0 (280nm); pressure 
= up to lOOOpsi; run time « 30 minutes. 

Peak fractions were analysed by SDS-PAGE and those 
fractions with pure heavy and light chain used in mixing 
25 experiments. 

c) Recombination of Chains 

The following mixes were set up: 

30 

NQ10.12.5 Heavy chain phage + Soluble NQ10.12.5 Heavy 
chain 

NQ10.12.5 Heavy chain phage + Soluble NQ10.12.5 Light 
35 chain 

NQ10.12.5 Heavy chain phage only 

anti-TNF Heavy chain phage + Soluble NQ10.12.5 Heavy 
40 chain 

anti-TNF Heavy chain phage + Soluble NQ10.12.5 Light 
chain 

45 anti-TNF Heavy chain phage only 

In each case, 1.10" T.U. of phage were mixed with 
25|ig r lOjig, 5pg or Ipg of purified soluble light chain 
and made to 5M Guanidine HCl/20mM Sodium phosphate, pH 
50 8.0 in 900yl final volume. These samples were then 
placed in a Pierce Microdialyzer System 500 sealed with 
Visking tubing (No. 2, Medicell Ltd., London Nl 1LX, 
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England). Samples were dialysed against 3 changes of 
lOOmM Tris-HCl, pH7.4 at 4«C for 48 hours and used in 
ELISA. 

The above treatment (particularly the 5M Guanidine 
HC1 step) appears to have little or no effect on the 
structure of the phage particle, at least with regard to 
infectivity, since there is no drop in T.U. of the sample 
over the course of the experiment. 

d) ELISA showing functionality of refolded antibodies 

The efficiency of refolding was assayed by ELISA 
against phOX-BSA, which detects correctly refolded 

15 antibody. This ELISA was performed in the same was as 
described in example 1. The key observation xs that 
functional antibody is indeed recovered when heavy chain 
NQ10.12.5 phage are mixed with purified NQ10.12.5 light 
Slain. in fact, there is little difference in signal 

20 obtained over the range of light chain c oncentrations 
down to lug, even when those phage are diluted; for the 
sake of clarity, just the lug results are shown in 
fig 12 It is evident from this that the stimulation is 
absolutely specific: stimulation is only seen when heavy 

25 chain anti-OX phage are mixed with anti-OX light chain- 
no stimulation is seen when heavy chain anti-OX pn f2 e 
alone or with heavy chain anti-OX phage mixed with 
soluble anti-OX heavy chain. Neither is any stimulation 
seen when anti-TNF heavy chain phage are used alone or 

30 have been mixed with anti-OX heavy or light chain. Only 
when the correct heavy and light combination is used is a 
functional antibody produced. 

This experiment demonstrates that the process works 
35 surprisingly well. The light chains could derive from 
other sources such as from a library constructed in 
E.coli or from serum antibody. The process will also 
work if heavy and light chains are reversed with respect 
to the above, i.e. light chain on phage and soluble heavy 
40 chain. 
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Example 4: Humanising rodent antibodies using 
polycombinatorial libraries and CDR 
imprinting 

5 CDR3 of the heavy chain is generally found to be the 

most variable of all CDRs in terms of both length and 
sequence, and can make important contacts with antigen 
( Winter , G. and Milstein C. Man-made Antibodies. (1991) 
Nature 349 , 293-299). This is an important consideration 

10 when humanising, which can be by CDR-grafting or chain- 
shuffling (PCT/GB91/01134) . The applicants have realised 
that it may be advantageous to apply the 
polycombinatorial approach to humanising by a chain- 
shuffling process in which the VHCDR3 sequence of the 

15 rodent antibody is imprinted upon the human VH segments. 

In this example a mouse anti-HIV gp!20 monoclonal 
antibody was humanised. The VH domain of this mouse 
antibody was fused to the human Cy 1 domain and cloned 
20 into pUC19Sfi/Not/polymyc. 

A repertoire of naive human light chains cloned as 
g3 fusions in fd-DOG was then infected into the cells 
carrying the chimaeric heavy chain, and phage selected on 
25 antigen. These phage have both heavy and light chains on 
their surface, though the phage genome encodes just the 
light chain; this is not a problem since the only heavy 
chain is the one provided. 

30 Light chains selected this way were then paired with 

a library of naive human VH domains, PCR-amplif ied in 
such a way that CDR3 of the human antibodies were 
replaced with that of the original mouse heavy chain. 

35 Section a) deals with construction of a chimaeric 

Fab fragment in which the mouse F58 VH and VL domains are 
fused to human CHI and CK sequences. This clone was used 
in early characterisation of the antibody and served as a 
template for subsequent PCR amplification of the heavy 

40 chain, which was then cloned into pUC19 Sfi-Not polymyc 
as a Pstl-Not I fragment (section b). Section c) 
describes construction of a human light chain repertoire 
in fd-DOG , which was then infected into cells containing 
the chimaeric heavy chain on pUC19 (section d). The 

45 resulting phage were panned against the peptide (section 
e) and the selected light chains PCR-amplif ied and cloned 
as Asc I -Not I fragments alongside the chimaeric heavy 
chain (section f) and assayed for their ability to bind 
antigen by EL1SA ( section g ) . Selected light chains were 

50 recloned in pUC (section h) and naive human VH domains 
amplified with a mutagenic primer imposing the F58 CDR3 
sequence on the domains, and the resulting fragments 
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cloned in phage (section i). This repertoire of 
imprinted heavy chain phage was then used to infect cells 
carrying the selected light chains on pUC and the 
resulting phage panned on antigen. Finally, the selected 
5 heavy and light chains are cloned together on the same 
replicon and assayed for binding to antigen (section j). 

a) Cloning of F58 Chimaeric Heavy Chain 

10 i) cDNA synthesis and primary PCR 

Five ml of cultured hybridoma cells (approximately 2 
x 106 cells) were washed in PBS, pelleted, resuspended in 
200 pi 0.1% diethylpyrocarbonate in water and immediately 

15 boiled for 5 minutes. After centrifugation, 68 pi of the 
•boilate' supernatant was added to a 28 pi reaction 
mixture resulting in a 96 pi reaction fix^re containing 
140mM KC1, 50mM Tris.HCl (pHS.l Q 42'C), 8mM MgCl 2 lOmM 
DTT 500uM deoxythymidine triphosphate, 500pM 

20 deoxycytidine triphosphate, 500pM d e ° x y^nhi?e 
triphosphate and 500pM deoxyguanine triphosphate 
nucleotide triphosphate (500pM dNTPs), 160 units of human 
placental RNAse inhibitor and lOpmol of forward primer. 
Four ul (100 units) of avian myeloblastosis virus ( AMv ) 

25 reverse transcriptase was added, the reaction incubated 

"~ at 42°C for 1 hour, heated to 100°C for 3 minutes, 
quenched on ice and centrifuged for 5 minutes. The 
supernatant was then used immediately for PCR. 



30 



40 



Separate PCR amplifications were performed for the 
heavy and light chains. Fifty pi reaction mixtures were 
prepared containing 5 pi of the supernatant from the cDNA 
Synthesis, 250pM dNTPs , 50mM KC1, lOOmM Tris. HC1 
(PH8.3), 1.5mM MgC12, 175pg/ml BSA, 20pmol each of the 
35 appropriate mixtures of forward and back primers 
(Clackson, T et al. (1991) supra) and lpl (5 units) 
Thermus aquaticus (Taq) DNA polymerase (Cetus, 
Emeryville, CA). The reaction mixture was overlaid with 
paraffin oil and subjected to 30 cycles of amplification 
using a Techne PHC-2 thermal cycler. The cycle was 94 C 
for 1 minute ( denaturation) , 55 °C for 1 minute 
(annealing) and 72°C for 1 minute (extension). The 
product was analyzed by running 5 pi on a 2% agarose gel. 
The remainder was extracted twice with ether, once with 
45 phenol/chloroform, ethanol precipitated and resuspended 
in 50 pi of H 2 0. 

ii) Cloning and sequencing of amplified VH and Vk DNA 

50 The amplified VH DNA was digested with PstI and 

BstEII purified on a 2% low melting point agarose gel 
and ligated into M13VHPCR1 digested with PstI and BstEII 
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(Orlandi, R. , D.H. Gussow, P.T. Jones and G. Winter 
(1989). Cloning immunoglobulin variable domains for 
expression by the polymerase chain reaction. Proc.Natl. 
Acad. Sci., USA 86 (10), 3833-7). The amplified VK DNA 
5 was digested with PvuII and Bgl II and ligated into 
M13VKPCR1 digested with Pvu II and Bel I. The ligation 
mixtures were used to transform competent TGI cells. Two 
clones from separate amplifications were sequenced for 
each VH and Vk chain using the dideoxynucleotide chain 
10 termination method. 

iii) Generation of an Fab construct for expression in 
E.coli 

15 A chimaeric Fab containing the F58 variable domains 

and the human IgGl CHI and Ck domains was constructed by 
ligating the F58 V-domains into a vector containing the 
constant domains. M13VHPCR1 containing the F58 VH was 
digested with PstI and BstEII. The resulting F58VH 

20 fragment was then purified on a 1.5% agarose gel, 
isolated from the gel with Geneclean and ligated into 
pJM-lFabD1.3 digested with PstI and BstEII (This clone 
has the same constant domains and restriction sites as 
FabNQ10.12.5 described in example 1 - in fact 

25 FabNQ10.12.5 was constructed using the constant domains 
pJM-1 FabD1.3. The ligation mixture was used to 
transform competent E.coli N4830-1 cells (Gottesman, M.E. 
Adhya, S. and Das, A. (1980) J.Mol.Biol. 140, 57-75) and 
clones containing the F58 VH identified by restriction 

30 analysis of RF DMA. The F58 Vk was amplified by PCR with 
the primers Vk2BACK and Vk3F0R2 (Clackson, T. et al 
(1991) supra) using M13VkPCR containing the F58 Vk as 
template. The PCR product was digested with SacI and 
Xhol, purified on a 1.5% agarose gel, isolated from the 

35 gel with Geneclean and ligated into pJM-1 Fab vector 
containing the F58 VH digested with SacI and Xhol. The 
ligation mixture was used to transform competent E.coli 
N4830-1 cells and clones containing the F58 Vk identified 
by restriction analysis of RF DNA. 

40 

b) PCR and Cloning of F58 Chimaeric Heavy Chain 

The F58 chimaeric heavy chain was PCR amplified from 
F58 Fab clone DNA, using the procedure described in 

45 example 1 and using the primers VH1BACKSFI15 and 
HUCH1F0RASCN0T (Table 1). The resulting ca. 700bp 
fragment was digested with Pst I and Not I and cloned 
into Pst I and Not I-cut pUC19Sfi/Not/polymyc plasmid 
using standard procedures (Sambrook, J. et al 1989, 

50 supra, and example 1). 

c) Construction of Light Chain Repertoire 
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The light chain repertoire was constructed using the 
same materials and conditions as described in example 2, 
with the exception that the pullthrough was performed 
using the following primers: 

Xchains: HUVLBAAPA (equimolar mix of 1-6) & HUCLFORSERNOT 
Kchains: HUVKBAAPA (equimolar mix of 1-6) & HUCKFORSERNOT 

10 PCR products were digested with Apa LI and Not I 

using the standard format and cloned into Apa LI and Not 
I-cut fd-DOG as described in examples 1&2. Phage were 
prepared from the library clones as described in examples 
1&2 and used to infect cells containing the heavy chain 

15 (see below). 

d) Production of Shuffled Fab Phage 

Cells containing the F58 chimaeric heavy chain were 
20 grown overnight at 37 °C in 2YTAG and 500 ]il added to 
50mls fresh 2YTAmp medium, prewarmed to 37 °C, in a 
conical flask. The cells were grown with shaking to 
0.D. 600 of ca. 0.5 before adding a total of lO" phage 
from the light chain repertoire. The culture was left at 
25 37 °C for 45 minutes without shaking then shaken 
vigorously 'for another 45 minutes at 37 °C before adding 
tetracyline to 15jig/ml and shaking overnight. 

Phage were harvested by PEG precipitation of the 
30 culture supernatant as previously described (examples 
1&2) and used for selection experiments. 

^ e) Panning of Shuffled Fab Phage 

35 This was performed on maxisorb plates as previously 

described (Marks J. et al., 1991, supra), with the 
exception that the tubes were coated with env gpl20 V3 
loop peptide of HIV-1 isolated IIIB dissolved to lOpg/ml 
in water. This peptide was obtained from the AIDS- 

40 directed program (repository ref: ADP737) and has the 
sequence: 

CTRPNNNTRRSIRIQRGPGRAFVTIGKIGNMRQAHCN 

45 The phage eluted from the tubes were used to re- 

infect fresh cells containing the F58 chimaeric heavy 
chain and the panning/re-infection procedure repeated 
another three times. 

50 f ) Recloning of Selected Light Chains 

Selected light chains were PCR-amplif ied from fd-DOG 
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light chain DNA using the procedures described in example 
1 and primers 63 LASCGTGBACK and HUCLFORSERNOT or 
HUCKFORSERNOT • The G3 LASCGTGBACK primer anneals upstream 
of the translational start of the gill signal in fd, and 
5 brings in an Asc I site and a ribosome binding site 
(RBS). These fragments were digested with Asc I and Not 
I and cloned into Asc I and Not I-cleaved pUC19F58 
plasmid (as shown in fig 13) so as to create a cistron r 
enabling soluble Fab to be produced. This was analysed 
10 for peptide binding in ELISA and bound antibody detected 
by virtue of the myc tag peptide on the end of the light 
chain. 



15 



40 



50 



g) ELISA 

This was performed as described below. 



1. Inoculate lOOpl 2xTY, 100 pg/ml ampicillin, 1% 
glucose in 96-well plates ('cell wells 1 , Nuclon) and 

20 grow with shaking (300 rpm) overnight at 37 °C. 

2. Use a 96-well transfer device to transfer small 
inocula from this plate to a second 96-well plate 
containing 200pl fresh 2xTY, lOOpg/ml ampicillin, 

25 0.1% glucose per well. Grow at 37 °C, shaking until 

0.D-600nm is approximately 0.9 (about 3 hrs). To 
the wells of the original plate f add 25}il 60% 
glycerol per well and store at -70 °C. 

30 3. Add 25jil 2xTY, lOOyg/ml ampicillin, 9mM IPTG (final 
concentration 1 mM IPTG). Continue shaking at 30°C 
for a further 16 to 24 hrs. 

4. Spin 4,000 rpm for lOmin and use 100pl supernatant 
35 in ELISA. 

5. Coat plate (Falcon 3912) with 50^1 per well of 
peptide at lOpg/ml in water. Leave overnight at 
room temperature. 



Rinse wells 3x with PBS f and block with 200pl per 
well of 1% BSA/PBS, for 1 hr at 37 °C. 



7. Rinse wells 3x with PBS, then add 25\xl 6% BSA/PBS to 
45 all wells. 

8. Add lOOpl culture supernatant containing soluble Fab 
to the appropriate wells. Mix, leave 1.5 hrs room 
temp . 



Discard test solution, and wash out wells 3 times 
with PBS, 0.05% Tween 20 and 3 times with PBS. 
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Pipette lOOul of 4ug/ml purified 9E10 antibody, in 
2% Marvel/PBS, into each well. Incubate at room 
temp, for 1.5 hrs. 

5 10. Discard 9E10 antibody, and wash out wells with 3 
times with PBS, 0.05% Tween 20 and 3 times with PBS. 
Pipette lOOul of 1:500 dilution of anti-mouse 
antibody ( peroxidase-con j ugated anti-mouse 
immunoglobulins, Dakopats/ICN, or peroxidase 
10 conjugated anti-mouse IgG, Fc-specific, Sigma A- 

2554). Incubate at room temp, for 1.5 hrs. 

11. Discard 2nd antibody, and wash wells 3 times with 
PBS, 0.05% Tween 20 and 3 times with PBS. 

15 12 Add one lOmg ABTS (2,2'-azino bis(3- 
ethylbenzthiazoline-6-sulphonic acid ) diamraonium 
salt) tablet to 20ml 50mM citrate buffer, pH4.5. 
(50mM citrate buffer, pH4.5 is made by mixing equal 

20 volumes 50mM trisodlum citrate and 50mM citric 

acid) . 

13. Add 2ul 30% hydrogen peroxide to the above solution 
immediately before dispensing. 

25 14. Add lOOul of the above solution to each well. Leave 
at room temp. 20-30 mins. 

15. Quench by adding 50ul 3.2mg/ral sodium fluoride. 
30 Read at 405nm. 

» ate i. Alternatively, inoculate clones from 
^formation plate into lOOul 2xTY, lOOug/ml 
ampicillin, 0.1% glucose in 96-well plates 
35 ('cell wells', Nuclon) and grow with shaking 

(300rpm) 37 °C, shaking until O.D. 6 oonm 1S 
approximately 0.9 (about 6 hrs). Continue with 
step 3- 

40 Note 2 : This method is based on that of DeBellis D. and 

Schwartz I., 1990 Nucl. Acids Res. 18: 1311 and 

relies on the low levels of glucose present in 
the starting medium being metabolised by the 
time the inducer (IPTG) is added. 

45 Note 3: 'Marvel' is dried milk powder. PBS is 5.84g 

NaCl 4.72g Na 2 HP0 4 and 2.64g NaH 2 P0 4 .2H 2 0, 

pH7.2, in 1 litre. BSA is Bovine Serum Albumin. 

50 h) Subcloning Selected Light Chains 

Light chains were PCR-amplified from DNA of selected 
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clones using an equimolar mix of the 7 HUVLBASFI primers 
in conjunction with HUCLFORSERASCNOT or an equimolar mix 
of the 6 HUVKBASFI primers in conjunction with 
HUCKFORSERASCNOT as described in example 2. PCR 
5 fragments were cut with Sfi I and Not I and cloned into 
Sfi I and Not I -cut pUC19Sfi/Not/polymyc then transformed 
into TGI. 

The panning/ infection process described above is 
10 essentially repeated again, this time with the positions 
of the heavy and light chains reversed. 

i) CDR- imprinting VH 

15 VH domains were amplified from the pooled primary 

PCR material described in example 2. Six separate 
pullthrough reactions were performed with the mutagenic 
F5 8GRAFT JH4SAL primer and each of the HUVHBAAPA1-6 
primers individually. Conditions for the pullthrough 

20 were the same as in example 2(d) except that the 
annealing temperature was lowered to 45 °C. 

The resulting VH fragments were pooled and cut with 
Apa LI and Sal I using standard conditions and cloned 

25 into Apa LI and Xho I -cut fd-DOG-GlCHl using standard 
protocols '(Sal 1 and Xho I produce compatible CTAG 
overhangs). Phage were prepared from this library as 
described above, this time using the heavy chain phage to 
infect cells carrying the selected light chains expressed 

30 in pUC19Sfi/Not/polymyc. 

j) Screening of Final Heavy-Light Combinations 

The end result of this process is a pool of selected 
35 heavy chains and a pool of selected light chains. These 
are now combined at random. Heavy chain clones are now 
PCR-amplif ied using an equimolar mix of all 6 HUVHBACKSFI 
primers in conjunction HUCH1F0RASCN0T using the procedure 
described in example 1. These fragments are cut with Sfi 
40 I and Asc I and gel-purified using standard procedures 
(Example 1), then ligated to equimolar quantities of Asc 
I -Not I -cut light chains produced in step f) above and 
Sfi I and Not I-cut pUC19Sfi/Not/polymyc vector, also 
produced earlier. Alternatively, these Sfi I-Asc I 
45 fragments replaced the F58 heavy chain in the constructs 
shown at the end of fig 13 (A). These constructs were 
then transformed into TGI and analysed for peptide 
binding activity by ELISA as described above. 

50 The end-products are completely human Fab fragments 

with the same or similar antigen-specificity as the 
parent rodent antibody. 
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Example 5 ^ ^ 

Display of Single Chain Fv and Fab Fragments D erived from the 
Anti-Oxazolone Antibody N010.12 -5 on Bacteriophage fd using 
5 pHENI and fdCAT2 

A range of constructs (see figure 18) were made from 
a clone (essentially construct II in pUC19) designed for 
expression in bacteria of a soluble Fab fragment (Better et 
al. 1988 see above) from the mouse anti-phOx (2-phenyl-5- 

10 oxazolone) antibody NQ10.12.5 (Griffiths, G. M. et al. Nature 
312 , 271-275, 1984). In construct II, the V-regions are 
derived from NQ10.12.5 and attached to human Ck and CHI ( 1 
isotype) constant domains. The C- terminal cysteine residues, 
which normally form a covalent link between light and heavy 

15 antibody chains, have been deleted from both the constant 
domains- To clone heavy and light chain genes together as Fab 
fragments (construct II) or as separate chains (constructs III 
and IV) for phage display, DNA was amplified from construct 
II by PCR to introduce a NotI restriction site at the 3 f end, 

20 and at the 5' end either an ApaLI site (for cloning into fd- 
CAT2) or Sfil sie (for cloning into pHENI). The primers 
FABNOTFOK with VH1BACKAPA (or VH1BACKSFI15 ) were used for PCR 
amplification of genes encoding Fab fragments (construct II), 
the primers FABNOTFOH with VH1BACKAPA (or VH1BACKSFI15) for 

25 heavy chains ('construct III), and the primers FABNOTFOK and 
MVKBAAPA (or MVKBASFI) for light chains (construct IV). 

The single-chain Fv version of NQ10.12.5 (construct I) 
has the heavy (VH) and light chain (Vk) variable domains 
joined by a flexible linker (Gly 4 Ser) 3 (Huston, J. S. et al. 

30 Proc. Natl. Acad. Sci. USA 85 5879-5883, 1988) and was 
constructed from construct II by 'splicing by overlap 
extension 1 as in example 14 of WO 92/01047. The assembled 
genes were reamplified with primers VK3F2N0T and VH1BACKAPA 
(or VH1BACKSFI15 ) to append restriction sites for cloning into 

35 fd-CAT2 (ApaLI-NotI) or pHENI (Sfil-NotI). 

VH1BACKAPA,5'-CAT GAC CAC AGT GCA CA G GT(C/G) (A/C)A(A/G) CTG 
CAG (C/G)AG TC(A/T) GG? 

VHlBACKSFI15,5 f -CAT GCC ATG ACT CGC GGC CCA G CC GGC CAT GGC 
40 C(C/G)A GGT (C/G)(A/C)A (A/G)CT GCA G(C/G)A GTC (A/T)GG; 

FABNOTFOH, 5 1 -CCA CGA TTC T GC GGC CGC TGA AGA TTT GGG CTC AAC 
TTT CTT GTC GAC; 

FABNOTFOK, 5 f -CCA CGA TTC TGC GGC CGC TGA CTC TCC GCG GTT GAA 
GCT CTT TGT GAC; 
45 MVKBAAPA, 5 T -CAC AGT GCA CT C GAC ATT GAG CTC ACC CAG TCT CCA; 
MVKBASFI, 5 1 -CAT GAC CAC G CG GCC CAG CCG GCC ATG GCC GAC ATT 
GAG CTC ACC CAG TCT CCA; 

VK3F2N0T,5'-TTC TGC GGC CGC CCG TTT CAG CTC GAG CTT GGT CCC. 
Restriction sites are underlined - 
50 Rescue of Phage and Phagemid particles 

Constructs I-IV (figure 27) were introduced into both fd-CAT2 
and pHENI. Phage fd-CAT2 (and fd-CAT2-I, II, III or IV) was 
taken from the supernatant of infected E.coli TGI after 
shaking at 37 °C overnight in 2xTY medium with 12.5yg/ml 
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tetracycline, and used directly in ELISA. Phagemid pHENl (and 
pHENl-I and II) in E.coli TGI (supE) were grown overnight in 
2 ml 2xTY medium, 100 pg/ml ampicillin, and 1% glucose 
(without glucose, expression of g3p prevents later 
5 superinfection by helper phage). lOpl of the overnight 
culture was used to innoculate 2 ml of 2xTY medium, lOOpg/ml 
ampicillin, 1% glucose, and shaken at 37 °C for 1 hour. The 
cells were washed and resuspended in 2xTY, 100 pg/ml 
ampicillin, and aphagemid particles rescued by adding 2 pi 
10 (10 8 pfu) VCSM13 helper phage ( Stratagene ) . After growth for 
one hour, 4pl kanamycin (25 mg/ml) was added, and the culture 
grown overnight. The phagemid particles were concentrated 10- 
fold for ELISA by precipitation with polyethylene glycol. 
ELISA 

15 Detection of phage binding to 2-phenyl-5-oxazolone (phOx) was 
performed as in example 9. 96-well plates were coated with 
10 pg/ml phOx-BSA or 10 pg/ml BSA in PBS overnight at room 
temperature, and blocked with PBSS containing 2% skimmed milk 
powder. Phage (mid) supernatant (50 pi) mixed with 50 pi PBS 

20 containing 4% skimmed milk powder was added to the wells and 
assayed. To detect binding of soluble scFv or Fab fragments 
secreted from pHENl, the c-myc peptide tag described by Munro 
and Pelham 1986 supra, was detected using the anti-myc 
monoclonal 9E10 (Evan, G. I. et al. Mol Cell Biol 5 3610-3616, 

25 1985) followed by detection with peroxidase-conjugated goat 
ant i -mouse immonoglobulin. Other details are as in example 
9. 

The constructs in fdDOGl and pHENl display antibody 
fragments of the surface of filamentous phage. The phage 

30 vector, fd-DOGl (figure 16) is based on the vector fd-tet 
(Zacher, A. N. et al. Gene 9 127-140, 1980) and has 
restriction sites (ApaLI and NotI) for cloning antibody genes 
(or other protein) genes for expression as fusions to the N- 
terminus of the phage coat protein g3p. Transcription of the 

35 antibody-g3p fusions in fd-DOGl is driven from the gene III 
promoter and the fusion protein targetted to the periplasm by 
means of the g3p leader. Fab abd scFv fragments of NQ10.12.5 
cloned into fd-DOGl for display were shown to bind to phOx-BSA 
(but not BSA) by ELISA (table 2). Phage were considered to 

40 be binding if A 405 of the sample was at least 10-fold greater 
that the background in ELISA. 

The phagemid vector, pHENl (fig. 17), is based upon 
pUC119 and contains restriction sites (Sfil and NotI) for 
cloning the fusion proteins. Here the transcription of 

45 antibody-g3p fusions is driven from the inducible lacZ 
promoter and the fusion protein targetted to the periplasm by 
means of the pelB leader. Phagemid was rescued with VCSM13 
helper phage in 2xTY medium containing no glucose or IPTG: 
under these conditions there is sufficient expression of 

50 antibody-g3p. Fab and scFv fragments of NQ10.12.5 cloned into 
pHENl for display were shown to bind to phOx-BSA (but not BSA) 
by ELISA (Table 2) using the same criterion as above. 

An alternative methodology for preparing libraries of 
Fab fragments expressed on the surface of phage would be to: 
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1. Prepare a library of phage expressing heavy chain 
(VHCH) genes from inserts in the phage genome. 
2 Prepare a library of light chain genes in a plamid 

expression vector in E.coli, preferably a phagemid, and 
5 isolate the soluble protein light chins expresed from this 

library^^ soluble protein light chains f romt he library 

to the heavy chain library displayed on phage. 

4. Select phage with the desired properties of affinity 

10 and specificity. 

These will encode the heavy chain (VHCH) genes. 
5 Isolate the light chain genes encoding ight chains 

which form suitable antigen binding sites in combination with 
the selected heavy chains, preferably by using superinfection 

15 of bacteria, containing phagemid expressing the light chain, 
with phage expressing the selected heavy chain and then 
assaying for antigen binding. 

Example 16 ... 
^nT^T Phaaemid Encodin g a Gene III Protein Fusion with 

20 Antibody Heavy or Light Chains by Phage — Encoding — the 

com plementary Antibody Ch ain Displayed on Phage and the Use 
nf -this Technique to Make Dual Combinat orial Libraries 

With random combinatorial libraries there is a 
limitation on the potential diversity of displayed Fab 

25 fragments due -to the transformation efficiency of bacterial 
cells. Described here is a strategy (dual combinatorial 
libraries) to overcome this problem, potentially increasing 
the number of phage surveyed by a factor of 10 . 

For assembly of heavy and light chains expresses from 

30 different vectors, phagemid (pHENl-III or IV) was grown in 
E.coli HB2151 (a non-supressor strain) to allow production of 
soluble chains, and rescued as above except that helper Pjage 
were used expressing partner chains as fusions to g3p (10 TU 
fd-DOGl-IV or III respectively) and 2 pi tetracycline (12.5 

35 mg/ml) in place of kanamycin. 

Separate Vectors to Encode Fab Heavy and Light Chains 
The heavy and light chains of Fab fragments can be encoded 
together in the same vector or in. different vectors. To 
demonstrate this the heavy chain (construct III) was cloned 

40 into pHENl (to provide soluble fragments) and the light chain 
(construct IV) into fd-DOGl (to make the fusion with g3p). 
The phagemid pHENl-III, grown in E.coli HB2151 (non-supressor) 
was rescued with fd-DOGl-IV phage, and phage (mid) shown to 
bind to phOx:BSA, but not to BSA (Table 2). This demonstrates 

45 that soluble light chain is correctly associating with the 
heavy chain anchored to the g3p, since neither heavy chain nor 
light chain alone bind antigen (Table 2). 

Similar results were obtained in the reverse experiment 
(with phagemid pHEN-l-IV and fd-CAT2-III phage) in which the 

50 heavy chain was produced as a soluble molecule and the light 
chain anchored to g3p (Table 2). Hence a Fab fragment is 
assembled on the surface of phage by fusion of either heavy 
or light chain to g3p, provided the other chain is secreted 
using the same or another vector (figure 19). 
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The resulting phage population is a mixture of phage 
abd rescued phagemid. The ratio of the two types of particle 
was assessed by infecting log phase E.coli TGI and plating on 
TYE plates with either 15 yg/ml tetracycline (to select for 
5 fd-DOGl) or 100 pg/ml ampicillin (to select for pHENl ) . The 
titre of fd-DOGl phage was 5 x 10 X1 TU/ml and the titre of 
pHENl 2 x 10 10 TU/ml , indicating a packaging ratio of 25 phage 
per phagemid. 

Demonstrated here is an alternative strategy involving 
10 display of the heterodimeric antibody Fab fragments on the 
surface of phage. One of the chains is fused to g3p and the 
other is secreted in soluble form into the periplasmic space 
of the E.coli where it associates non-covalently with the g3p 
fusion , and binds specifically to antigen. Either the light 
15 or heavy chain can be fused to the g3p: they are displayed on 
the phage as Fab fragments and bind antigen (Figure 19). 
Described are both phage and phagemid vectors for surface 
display. Phagemids are probably superior to phage vectors for 
creation of large phage display libraries. Particularly in 
20 view of their higher transfection efficiencies (Two to three 
orders of magnitude higher ), allowing larger libraries to be 
constructed. The phagemid vector , pHENl also allows the 
expression of soluble Fab fragments in non-suppressor E.coli. 

Also demonstrated here is that heavy and light chains 
25 encoded on the same vector (construct II), or on different 
vectors (constructs III and IV) can be displayed as Fab 
fragments. This offers two distinct ways of making random 
combinatorial libraries for display. Libraries of heavy and 
light chain genes, amplified by PCR, could be randomly linked 
30 by a f PCR assembly' process (example 14 of WO 92/01047) based 
on 'splicing by overlap extension' , cloned into phage (mid) 
display vectors and expressed from the same promoter as part 
of the same transcript (construct II) as above, or indeed from 
different promoters as separate transcripts. Here the 
35 phage(mid) vector encodes and displays both chains. For a 
combinatorial library of 10 7 heavy chains and 10 7 light 
chains, the potential diversity of displayed Fab fragments 
(10 14 ) is limited by the transfection efficiency of bacterial 
cells by the vector (about 10 9 clones per pg cut and ligated 
40 plasraid at best) (W.J. Dower et al Nucl. Acids. Res. 16 6127- 
6145, 1988). Libraries thus prepared are analogous to the 
random combinatorial library method described by Huse, W.D. 
et al Science 246 1275-1281 (1989), but have the important 
additional feature that display on the surface of phage gives 
45 a powerful method of selecting antibody specificities from the 
large number of clones generated. 

Alternatively, libraries of heavy and light chains 
could be cloned into different vectors for expression in the 
same cell, with a phage vector encoding the g3p fusion and a 
50 phagemid encoding the soluble chain. The phage acts as a 
helper, and the infected bacteria produced both packaged phage 
and phagemid. Each phage or phagemid displays both chains but 
encodes only one chain and thus only the genetic information 
for half of the antigen-binding site. However, the genes for 
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both antibody chains can be recovered separately by plating 
on the selective medium, suggesting a means by which mutually 
complementary pairs of antigen binding heavy and light chain 
combinations could be selected from random combinatorial 
5 libraries- For example, a light chain repertoire on fd phage 
could be used to infect cells harbouring a library of soluble 
heavy chains on the phagemid. The affinity purified phagemid 
library could then be used to infect E.coli, rescued wxth the 
affinity purified phage library, and the new combinatorial 

10 library subjected to a further round of selection. Thus, 
antibody heavy and light chain genes are reshuffled after each 
round of purification. Finally, after several rounds, 
infected bacteria could be plated and screened individually 
for antigen-binding phage. Such 'dual' combinatorial 

15 libraries are potentially more diverse than those encoded on 
a single vector. By combining separate libraries of 10 light 
chain phage(mid)s, the diversity of displayed Fab fragments 
(potentially 10") is limited only by the number of bacteria 
(10 12 per litre). More simply, the use of two vectors should 

20 also facilitate the construction of 'hierarchical' libraries, 
in which a fixed heavy or light chain is paired with a library 
or partners (example 22), offering a means of * fine-tuning 
antibody affinity and specificity. 
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CLAIMS 

1. A method of producing multimeric specific binding 

pair (sbp) members , which method comprises expressing 
5 from a vector in recombinant host organism cells a 

population of a first polypeptide chain of a specific 
binding pair member fused to a component of a secreted 
replicable genetic display package (rgdp) which thereby 
displays said polypeptide chains at the surface of rgdps, 

10 and combining said population with a population of a 

second polypeptide chain of said specific binding pair 
member by 'causing or allowing first and second 
polypeptide chains to come together to form a library of 
said multimeric specific binding pair members displayed 

15 by rgdps,, said population of second polypeptide chains 
not being expressed from the same vector as said 
population of first polypeptide chains, at least one of 
said populations being genetically diverse and expressed 
from nucleic acid that is capable of being packaged using 

20 said rgdp component, whereby the genetic material of each 
said rgdp encodes a polypeptide chain of a said 
genetically diverse population • 

2. A method according to claim 1 wherein at least one 

25 of said populations is expressed from a phage vector. 
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3. A method according to claim 1 or claim 2 wherein 
at least one of said populations is expressed from a 
phagemid vector , the method including using a helper 
phage or a plasmid expressing complementing phage genes, 
to help package said phagemid genome , and said component 
of the rgdp is a capsid protein therefore. 

4. A method according to any one of claims 1 to 3 
wherein first and second polypeptide chains are expressed 

10 in the same host organism cell. 

5. A method according to any one of the preceding 
claims wherein each said polypeptide chain is expressed 
from nucleic acid which is capable of being packaged as a 

15 rgdp using said component fusion product, whereby 

encoding nucleic acid for both said polypeptide chains is 
packaged in respective rgdps. 

6. A method according to any one of the preceding 
20 claims which comprises introducing vectors capable of 

expressing a population of said first polypeptide chains 
into host organisms which express a population of said 
second polypeptide chains in free form, or introducing 
vectors capable of expressing a population of said second 
25 polypeptide chains in free form into host organisms which 
express a population of said first polypeptide chains. 



5 



SUBSTITUTE SHEET 



WO 92/20791 



PCT/GB92/00883 



80 

7. A method according to any one of claims 1 to 5 
wherein said second polypeptide chains are each expressed 
as a fusion with a component of a rgdp which thereby 

5 displays said second polypeptide chains at the surface of 
rgdps . 

8. A method according to any one of claims 1 to 3 
wherein the population of second polypeptide chains is 

10 not expressed in the same host organism cells as the 
population of first polypeptide chains, the method 
comprising* the following additional steps: 

(a) forming an extracellular mixture of a 
population of soluble second polypeptide chains and rgdps 

15 displaying a population of first polypeptide chains; and 

(b) causing or allowing first and second 
polypeptide chains to come together to form the library 
of said multimeric specific binding pair members. 

20 9. A method according to claim 8 wherein said mixture 

is partially denatured before being renatured to cause or 
allow said first and second polypeptide chains to come 
together to form said library. 

25 10. A method according to claim 9 wherein the 

population of second polypeptide chains comprises a 
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repertoire of polypeptides purified from a human or* 
animal source. 

11. A method according to any one of claims 1 to 10 
5 wherein any of the populations of said polypeptide chains 
is derived from: 

(i) the repertoire of rearranged 

immunoglobulin genes of an animal immunised with 
complementary sbp member; 
10 (ii) the repertoire of rearranged 

immunoglobulin genes of an animal not immunised with 
complementary sbp member; 

(iii) a repertoire of an artificially rearranged 
immunoglobulin gene or genes; 
15 (iv) a repertoire of an immunoglobulin 

homolog gene or genes; or 

(v) a repertoire of sequences derived 
from a germ-line immunoglobulin gene or genes; 

(vi) a repertoire of an immunoglobulin 

20 gene or genes artificially mutated by the introduction of 
one or more point mutations. 

(vii) a mixture of any of (i), (ii), 
(iii), (iv), (v) and (vi). 

25 12. A method according to any one of the preceding 

claims wherein said sbp member comprises a domain which 
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is, or is homologous to, an immunoglobulin domain* 

13. A method according to any one of the preceding 
claims wherein the rgdp is a bacteriophage, the host is a * 

5 bacterium, and said component of the rgdp is a capsid 
protein for the bacteriophage. 

14. A method according to claim 13 wherein the phage 
is a filamentous phage. 

10 

15. A method according to claim 14 wherein the phage 
is selecte'd from the class I phages fd, M13, fl, Ifl, 
Ike, ZJ/Z, Ff and the class II phages Xf, Pfl and Pf3. 

15 16. A method according to claim 14 or claim 15 wherein 
the first polypeptide chains are expressed as fusions 
with the gene III capsid protein of phage fd or its 
counterpart in another filamentous phage. 

20 17. A method according to claim 16 wherein the first 
polypeptide chains are each inserted in the N- terminal 
region of the mature capsid protein downstream of a 
secretory leader peptide. 

25 18. A method according to any one of claims 13 to 17 
wherein the host is E.coli. 
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19. A method according to any one of the preceding 
claims wherein nucleic acid encoding an sbp member 
polypeptide is linked downstream to a viral capsid 
protein through a suppressible translational stop codon. 

5 

20. A method according to any one of the preceding 
claims wherein rgdps formed by said expression are 
selected or screened to provide an individual sbp member 
or a mixed population of said sbp members associated in 

10 their respective rgdps with nucleic acid encoding a 
polypeptide chain thereof. 

21. A method according to claim 20 wherein the rgdps 
are selected by affinity with a member complementary to 

15 said sbp member. 

22. A method according to claim 21 which comprises 
recovering any rgdps bound to said complementary sbp 
member by washing with an eluant. 

20 

23. A method according to claim 22 wherein the eluant 
contains a molecule which compete with said rgdp for 
binding to the complementary sbp member. 

25 24. A method according to any one of claims 21 to 23 
wherein the rgdp is applied to said complementary sbp 
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member in the presence of a molecule which competes with 
said package for binding to said complementary sbp 
member. 

5 25. A method according to any one of claims 20 to 24 
wherein nucleic acid derived from a selected or screened 
rgdp is used to express said sbp member or a fragment or 
derivative thereof in a recombinant host organism. 

10 26. A method according to claim 20 wherein nucleic 
acid from one or more rgdp's is taken and used in a 
further method to obtain an individual sbp member or a 
mixed population of sbp members, or polypeptide chain 
components thereof, or encoding nucleic acid therefor. 

15 

27. A method according to claim 26 wherein the nucleic 
acid taken encodes said first polypeptide chains and is 
introduced into a recombinant vector into which nucleic 
acid from a genetically diverse repertoire of nucleic 

20 acid encoding said second polypeptide chains is also 
introduced, or wherein the nucleic acid taken encodes 
said second polypeptide chains and is introduced into a 
recombinant vector into which nucleic acid from a 
genetically diverse repertoire of nucleic acid encoding 

25 said first polypeptide chains is also introduced. 
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28. A method according to claim 27 which includes the 
step of causing or allowing the recombinant vector to be 
produced by intracellular recombination between a vector 
comprising nucleic acid encoding first polypeptide chain 

5 and a vector comprising nucleic acid encoding a second 
polypeptide chain. 

29. A method according to claim 28 wherein the 
intracellular recombination is promoted by inclusion in 

10 the vectors of sequences at which site-specific 
recombination will occur. 

30. A method according to claim 29 wherein said 
resultant recombinant vector comprises nucleic acid 

15 encoding a single chain Fv region derivative of an 

immunoglobulin resulting from recombination between first 
and second vectors. 

31. A method according to claim 29 or 30 wherein the 
20 sequences at which site-specific recombination will occur 

are loxP sequences obtainable from coliphage PI, and 
site-specific recombination is catalysed by Cre- 
recombinase, also obtainable from coliphage PI. 

25 32. A method according to claim 31 wherein the Cre- 
recombinase used is expressible under the control of a 
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regulatable promoter. 

33. A method according to any one of claims 28 to 32 
wherein the vector comprising nucleic acid encoding the 

5 first polypeptide chain is a phage or phagemid and the 
vector comprising nucleic acid encoding the second 
polypeptide chain is a plasmid, or the vector comprising 
nucleic acid encoding the first polypeptide chain is a 
plasmid and the vector comprising nucleic acid encoding 
10 the second polypeptide chain is a phage or phagemid, and 
the intracellular recombination takes place in a 
bacterial host which replicates plasmids preferentially 
over phages or phagemids, or which replicates phages or 
phagemids preferentially over plasmids. 

15 

34. A method according to claim 33 wherein said 
bacterial host is a PolA strain of E.coli or of another 
grain-negative bacterium. 

20 35. A method of producing multimeric specific binding 
pair (sbp) members, which method comprises 

(i) causing or allowing intracellular 
recombination between (a) first vectors comprising 
nucleic acid encoding a population of a fusion of a first 

25 polypeptide chain of a specific binding pair member and a 
component of a secreted replicable genetic display 
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package (rgdp) and (b) second vectors comprising nucleic 
acid encoding a population of a second polypeptide chain 
of a specific binding pair member, at least one of said 
populations being genetically diverse, the recombination 
5 resulting in recombinant vectors each of which comprises 
nucleic acid encoding a said polypeptide fusion and a 
said second polypeptide chain and capable of being 
packaged using said rgdp component; and 

(ii) expressing said polypeptide fusions and 
10 said second polypeptide chains, producing rgdps which 
display at their surface said first and second 
polypeptide chains and which each comprise nucleic acid 
encoding a said first polypeptide chain and a said second 
polypeptide chain. 

15 

36. A method according to claim 35 wherein the 
intracellular recombination is promoted by inclusion in 
the vectors of sequences at which site- specific 
recombination will occur. 

20 

37. A method according to claim 36 wherein said 
resultant recombinant vector comprises nucleic acid 
encoding a single chain Fv region derivative of an 
immunoglobulin resulting from recombination between first 

25 and second vectors. 
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38. A method according to claim 36 or claim 37 
wherein the sequences at which site-specific 
recombination will occur are loxP sequences obtainable 
from coliphage PI, and site-specific recombination is 

5 catalysed by Cre-recombinase, also obtainable from 
coliphage PI. 

39. A method according to claim 38 wherein the Cre- 
recombinase used is expressible under the control of a 

10 regulatable promoter. 

40. A method according to any one of claims 35 to 39 
wherein the first vectors are phages or phagemids and the 
second vectors are plasmids, or the first vectors are 

15 plasmids and the second vectors are phages or phagemids, 
and the intracellular recombination takes place in a 
bacterial host which replicates plasmids preferentially 
over phages or phagemids, or which replicates phages or 
phagemids preferentially over plasmids. 

20 

41. A method according to claim 39 wherein said 
bacterial host is a PolA strain of E.coli or of another 
grain-negative bacterium. 

25 42. A method according to any one of claims 35 to 41 

wherein nucleic acid from one or more rgdp's is taken and 
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used in a further method to obtain an individual sbp 
member or a mixed population of sbp members , or 
polypeptide chain components thereof, or encoding nucleic 
acid therefor. 

5 

43. A method of producing one or a selected population 
of multichain polypeptide members of a specific binding 
pair (sbp members) specific for a counterpart specific 
binding pair member of interest, which method comprises 
10 the following steps: 

(i) expressing from a vector in recombinant host 
organism cells a genetically diverse population of a 
first polypeptide chain of said multichain protein , fused 
to a component of a replicable genetic display package 

15 (rgdp) which thereby displays said polypeptide chains at 
the surface of rgdps; 

(ii) combining said population with a unique or 
restricted population of second polypeptide chains of 
said multichain sbp members, not being expressed from the 

20 same vector as said population of first polypeptide 
chains, said combining forming a library of said 
multichain sbp members displayed by rgdps, said 
genetically diverse population being expressed from 
nucleic acid which is capable of being packaged using 

25 said rgdp component, whereby the genetic material of each 
said rgdp encodes a said first polypeptide chain; 
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(iii) selecting by affinity with said counterpart sbp 
member of interest multichain sbp members specific for 
said counterpart sbp member associated in their 
respective rgdps with nucleic acid encoding a said first 

5 polypeptide chain thereof; 

(iv) combining said first polypeptide chains of 
multichain sbp members selected in step (iii) with a 
genetically diverse population of second polypeptide 
chains of multichain sbp members, the said second 

10 polypeptide chains being fused to a component of a rgdp 
which thereby displays them at the surface of rgdps, the 
said combining in this step (iv) forming a library of 
multichain sbp members from which one or more multichain 
sbp members specific for said counterpart sbp member are 

15 selectable by affinity with it. 

44. A method according to claim 43 wherein said 
multichain sbp members are antibodies, or other members 
of the immunoglobulin family, or binding fragments 

20 thereof. 

45. A method according to claim 44 wherein each of 
said second chains combined in step (ii) comprise a 
variable domain derived from a non-human animal antibody 

25 specific for the antigen of interest. 



WO 92/20791 



PCT/GB92/00883 



91 

46. A method according to claim 45 wherein said second 
polypeptide chains are chimaeric, comprising a human 
antibody domain. 

5 47. A method according to claim 46 wherein said human 
antibody domain comprises Cyl. 

48. A method according to any one of claims 44 to 47 
comprising an additional step (v) wherein humanised 

10 antibodies for said antigen are selected by affinity with 
it. 

49. A kit for use in carrying out a method according 
to any one of claims 1-48 , said kit having the following 

15 components in additional to ancillary components required 
for carrying out the method: 

(i) a vector having the following features: 
(a) an origin of replication for single-stranded 
bacteriophage, (b) a restriction site for insertion of 

20 nucleic acid encoding or a polypeptide component of an 

sbp member, (c) said restriction site being in the 5' end 
region of the mature coding sequence of a phage capsid 
protein, and (d) with a secretory leader sequence 
upstream of said site which directs a fusion of the 

25 capsid protein and sbp polypeptide to the periplasmic 
space of a bacterial host; and (ii) another vector, 
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having some or all of the features (a), (b), (c) and (d) 
of the vector described in (i). 

50. A kit for use in carrying out a method according 
5 to any one of claims 36 to 41 having the following 

components in addition to ancillary components required 
for carrying out the method: 

(i) a first vector having the following features: 
(a) a restriction site for insertion of nucleic acid 

10 encoding or a polypeptide component of an sbp member , (b) 
said restriction site being in the 5 f end region of the 
mature coding sequence of a phage capsid protein, and (c) 
with a secretory leader sequence upstream of said site 
which directs a fusion of the capsid protein and sbp 

15 polypeptide to the periplasmic space of a bacterial host; 
and 

(ii) a second vector having a restriction site for 
insertion of nucleic acid encoding a second said 
polypeptide chain, 

20 (iii) at least one of the vectors having an origin of 
replication for single-stranded bacteriophage, and 
(iv) the vectors having sequences at which site- 
specific recombination will occur. 
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